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PNEUMOCOCCAL GENES, PORTIONS THEREOF, 
EXPRESSION PRODUCTS THEREFROM, 
AND USES OF SUCH GENES, PORTIONS AND PRODUCTS 

RBIATED APPLICATIONS 

This application is a continuation-in-part ("CIP"): of 
application Serial Nos. 08,529,055, filed September 15, 1995, 
08/226,844, filed May 29, 1992, 08/093,907, filed May 29, 1992, 
07/884,918, filed July 5, 1994 (corresponding to PCT/US93/05191) ; 
of application Serial No. 08/482,981, filed June 7, 1995; of 
application Serial No. 08/458,399, filed June 2, 1995; of 
application Serial No. 08/446,201, filed May 19, 1995 (as a CIP 
of USSN 08/246,636); of application Serial No. 08/246,636, filed 
May 20, 1994 (as a CIP of USSN 08/048,896, filed April 20, 1993 
as a CIP of USSN 07/835,698, filed February 12, 1992 as a CIP of 
USSN 07/656,773); of application Serial 08/319,795, filed October 
7, 1994 (as a CIP of USSN 08/246,636); of application Serial No. 
08/072,070, filed June 3, 1993; of application Serial No. 
07/656,773, filed February 15, 1991 (USSN 656,773 and 835,698 
corresponding to Int'l application WO 92/1448); and, each of 
these applications, as well as each application, document^ or 
reference cited in these applications, is hereby incorporated 
herein by reference- Documents or references are also cited in 
the following text, either in a Reference List appended to 
certain Examples, or before the claims, or in the text itself; 
and, each of these documents or references is hereby expressly 
incorporated herein by reference. 
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FIELD OF THE INVENTION 

This invention relates to pneumococcal genes, portions 
thereof, expression products therefrom and uses of such genes, 
portions and products; especially to genes of Streptococcus 
pneumoniae, e.g., the gene encoding pneumococcal surface protein 
A (PspA) (said gene being "pspA") . pspA-like genes, pneumococcal 
surface protein C (PspC) (said gene being "pspC") , portions of 
such genes, expression products therefrom, and the uses of such 
genes, portions thereof and expression products therefrom. Such 
uses include uses of the genes and portions thereof for obtaining 
expression products by recombinant techniques, as well as for 
detecting the presence of Streptococcus pneumoniae or strains 
thereof by detecting DNA thereof by hybridization or 
amplification (e.g., PCR) and hybridization techniques (e.g., 
obtaining DNA-containing sample, contacting same with genes or 
fragment under PCR, amplification and/or hybridization 
conditions, and detecting presence of or isolating hybrid or 
amplified product) . The expression product uses include use in 
preparing antigenic, immunological or vaccine compositions, for 
eliciting antibodies, an immunological response (other than or 
additional to antibodies) or a protective response (including 
antibody or other immunological response by administering 
composition to a suitable host); or, the expression product can 
be for use in detecting the presence of Streptococcus pneumoniae 
by detecting antibodies to Streptococcus pneumoniae protein (s) or 
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antibodies to a portion thereof in a host, e,g., by obtaining an 
antibody-containing sample from a relevant host, contacting the 
sample with expression product and detecting binding (for 
instance by having the product labeled); and, the antibodies 
generated by the aforementioned compositions are useful in 
diagnostic or detection kits or assays. Thus, the invention 
relates to varied compositions of matter and methods for use 
thereof . 

BACKGROUND OF THE INVENTION 

Streptococcus pneumoniae is an important cause of 
otitis media, meningitis, bacteremia and pneumonia. Despite the 
use of antibiotics and vaccines, the prevalence of pneumococcal 
infections has declined little over the last twenty-five years* 

It is generally accepted that immunity to Streptococcus 
pneumoniae can be mediated by specific antibodies against the 
polysaccharide capsule of the pneumococcus • However, neonates 
and young children fail to make an immune response against 
polysaccharide antigens and can have repeated infections 
involving the same capsular serotype. 

One approach to immunizing infants against a number of 
encapsulated bacteria is to conjugate the capsular polysaccharide 
antigens to protein to make them immunogenic. This approach has 
been successful, for example, with Haemophilus influenzae b (see 
U.S, Patent no. 4,496,538 to Gordon and U.S. Patent no. 4,673,574 
to Anderson) . However, there are over eighty known capsular 
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serotypes of S. pnsumoniaB of which twenty-three account for most 
of the disease. For a pneumococcal polysaccharide -protein 
conjugate to be successful, the capsular types responsible for 
most pneumococcal infections would have to be made adequately 
immunogenic. This approach may be difficult, because the twenty- 
three polysaccharides included in the presently-available vaccine 
are not all adequately immunogenic, even in adults. 

An alternative approach for protecting children, and 
also the elderly, from pneumococcal infection would be to 
identify protein antigens that could elicit protective immune 
responses. such proteins may serve as a vaccine by themselves, 
may be used in conjunction with successful polysaccharide-protein 
conjugates, or as carriers for polysaccharides. 

McDaniel et al. (I) , j. Exp. Med. 160:386-397, 1984, 
relates to the. production of hybridoma antibodies that recognize 
cell surface polypeptide (s) on S. pneumoniaB and protection of 
mice from infection with certain strains of encapsulated 
pneumococci by such antibodies. This surface protein antigen has 
been termed "pneumococcal surface protein A" or PspA for short. 

McDaniel et al. (li) , Microbial Pathogenesis 1:519-531, 
1986, relates to studies on the characterization of the PspA. 
considerable diversity in the PspA molecule in different strains 
was found, as were differences in the epitopes recognized by 
different antibodies. 
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McDaniel et al, (III), Exp. Med. 165:381-394, 1987, 
relates to immunization of X-linked immunodef icient (XID) mice 
with non-encapsulated pneumococci expressing PspA, but not 
isogenic pneumococci lacking PspA, protects mice from subsequent 
fatal infection with pneumococci. 

McDaniel et al. (IV), Infect. Iramun. , 59:222-228, 1991, 
relates to immunization of mice with a recombinant full length 
fragment of PspA that is able to elicit protection against 
pneumococcal strains of capsular types 6A and 3. 

Grain et al. Infect. Immun. , 56:3293-3299, 1990, relates 
to a rabbit antiserum that detects PspA in 100% (n = 95) of 
clinical and laboratory isolates of strains of S. pneumoniae. 
When reacted with seven monoclonal antibodies to PspA, fifty- 
seven S. pneumoniae isolates exhibited thirty-one different 
patterns of reactivity. 

The PspA protein type is independent of capsular type. 
It would seem that genetic mutation or exchange in the 
environment has allowed for the development of a large pool of 
strains which are highly diverse with respect to capsule, PspA, 
and possibly other molecules with variable structures. 
Variability of PspA's from different strains also is evident in 
their molecular weights, which range from 67 to 99 kD. The 
observed differences are stably inherited and are not the result 
of protein degradation. 
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Immunization with a partially purified PspA from a 
recombinant A gtll clone, elicited protection against challenge 
with several 5. pneumoniae strains representing different 
capsular and PspA types, as described in McDaniel et al. (IV), 
Infect. Immun. 59:222-228, 1991. Although clones expressing PspA 
were constructed according to that paper, the product was 
insoluble and isolation from cell fragments following lysis was 
not possible. 

While the protein is variable in structure between 
different pneumococcal strains , numerous cross-reactions exist 
between all PspA's, suggesting that sufficient common epitopes 
may be present to allow a single PspA or at least a small number 
of PspA's to elicit protection against a large number of s. 
pneumoniae strains. 

In addition to the published literature specifically 
referred to above, the inventors, in conjunction with co-workers, 
have published further details concerning PspA's, as follows: 

1. Abstracts of 89th Annual Meeting of the American 
Society for Microbiology, p. 125, item D-257, May 
1989; 

2. Abstracts of 90th Annual Meeting of the American 
Society for Microbiology, p. 98, item D-106, May 
1990; 

3. Abstracts of 3rd International ASM Conference on 
Streptococcal Genetics, p. ii, item 12, June 1990; 

4. Talkington et al, Infect. Immun. 59:1285-1289, 
1991; 
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5. Yother et al (I), J. Bacteriol, 174:601-609, 1992; 
and 

6. Yother et al (11), J. Bacteriol. 174:610-618, 
1992. 

7. McDaniel et al (V), Microbiol, Pathogenesis, 
13:261-268» 

It would be useful to provide PspA or fragments thereof 
in compositions, including PspA's or fragments from varying 
strains in such compositions, to provide antigenic, immunological 
or vaccine compositions; and, it is even further useful to show 
that the various strains can be grouped or typed, thereby 
providing a basis for cross-reactivities of PspA's or fragments 
thereof, and thus providing a means for determining which strains 
to represent in such compositions (as well as how to test for, 
detect or diagnose one strain from another) . 

Further, it would be advantageous to provide a pspA - 
like gene or a pspC gene in certain strains, as well as primers 
(oligonucleotides) for identification of such a gene, as well as 
of conserved regions in that gene and in pspA; for instance, for 
detecting, determining, isolating, or diagnosing strains of S. 
pneumonia. These uses and advantages, it is believed, have not 
heretofore been provided in the art. 
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OBJECT S AND SUMMARY OF THE INVENTTnu 

The invention provides an isolated amino acid molecule 
comprising residues 1 to lis, l to 260, 192 to 588, 192 to 299, 
or residues 192 to 260 of pneumococcal surface protein A of 
Streptococcus pneumoniae. 

The invention further provides an isolated DNA molecule 
comprising a fragment of a pneumococcal surface protein A gene of 
Streptococcus pneumoniae encoding the isolated amino acid 
molecule. 

The invention also provides PCR primers or 
hybridization probes comprising the isolated DNA molecule. 

The invention additionally provides an antigenic, 
vaccine or immunological composition comprising the amino acid 
molecule. 

The invention includes an isolated DNA molecule 
comprising nucleotides 1 to 26, 1967 to 1990, 161 to 187, 1093 to 
1117, or 1312 to 1331 or 1333 to 1355 of a pneumococcal surface 
protein A gene of Streptococcus pneumoniae. The DNA molecule can 
be used as a PCR primer or hybridization probe; and therefore the 
invention comprehends a PCR primer or hybridization probe 
comprising the isolated DNA molecule. 

The invention also includes an isolated DNA molecule 
comprising a fragment having homology with a portion of a 
pneumococcal surface protein A gene of Streptococcus pneumoniae. 
The DNA preferably is the following (which include the portion 
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having homology and restriction sites, and selection of other 

restriction sites or sequences for such DNA is within the ambit 

of the skilled artisan from this disclosure) : 

CCGGATCCAGCTCCTGCACCAAAAAC ; 
GCGCGTCGACGGCTTAAACCCATTCACCATTGG ; 
CCGGATCCTGAGCCAGAGCAGTTGGCTG ; 
CCGGATCCGCTCAAAGAGATTGATGAGTCTG ; 
GCGGATCCCGTAGCCAGTCAGTCTAAAGCTG ; 
CTGAGTCGACTGGAGTTTCTGGAGCTGGAGC ; 
CCGGATCCAGCTCCAGCTCCAGAAACTCCAG ; 
GCGGATGCTTGACCAATATTTACGGAGGAGGC ; 
GTTTTTGGTGCAGGAGCTGG; * 
GCTATGGGCTACAGGTTG ; 
CCACCTGTAGCCATAGC ; 

CCGCATCCAGCGTGCCTATCTTAGGGGCTGGTT ; and 
GCAAGCTTATGATATAGAAATTTGTAAC 

(thus, the invention broadly comprehends DNA homologous to 

portions of pspA; preferably further including restriction 

sequences) . 

These DNA molecules can be used as PCR primers or 
probes; and thus, the invention comprehends a primer or probe 
comprising and of these molecules. 

The invention further still provides PCR probe (s) which 
distinguishes between pspA and pspA-like nucleotide sequence, as 
well as PCR probe (s) which hybridizes to both pspA and pspA-like 
nucleotide sequences » 

Additionally, the invention includes a PspA extract 
prepared by a process comprising: growing pneumococci in a first 
medium containing choline chloride, eluting live pneumococci with 
a choline chloride containing salt solution, and growing the 
pneumococci in a second medium containing an alkanolamine and 
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substantially no choline; as well as a PspA extract prepared by 
that process and further comprising purifying PspA by isolation 
on a choline-Sepharose affinity column. These processes are also 
included in the invention. 

An immunological composition comprising thses extracts 
is comprehended by the invention, as well as an immunological 
composition comprising the full length PspA. 

A method for enhancing the immunogenicity of a PspA- 
containing immunological composition comprising, in said 
composition, the C-terminal portion of PspA, is additionally 
comprehended, as well. 

An immunological composition comprising at least two 
PspAs. The latter immunological composition can have the PspAs 
from different groups or families; the groups or families can be 
based on RFLP or sequence studies (see, e.g.. Fig. 13). 

Further, the invention provides an isolated amino acid 
molecule comprising pneumococcal surface protein C, Pspc, of 
Streptococcus pneumoniae having an alpha-helical, proline rich 
and repeat regions, an isolated DNA molecule comprising a 
pneumcoccal surface protein C gene encoding the aforementioned 
Pspc, and primers and hybrization probes consisting essentially 
of the isolated DNA molecule. 

Still further, an isolated amino acid molecule 
comprising pneumococcal surface protein C, PspC, of Streptococcus 
pneumoniae is provided, having an alpha-helical, proline rich and 
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repeat regions, having substantial homology with a protection 
eliciting region of PspA, and an isolated DNA molecule comprising 
a pneumcoccal surface protein C gene encoding the aforementioned 
PspC, and primers and hybrization probes consisting essentially 
of the isolated DNA molecule are provided by the present 
invention. 

Additionally, the present invention provides 
immunological compositions comprising PspC. 

These and other embodiments are disclosed or are 
obvious from the following detailed description. 
BRIEF DESCRIPTION OF THE FIGURES 

Figures lA and IB show: Evaluation of digested plasmid 
constructs , Fig. lA: 1% agarose gel electrophoresis of plasmids 
isolated from transformed E. coli BL21(DE3) strains stained with 
ethidium bromide. Lane 1: 1 kb DNA ladder (sizes noted in kb) , 
lane 2: pRCT125; lane 3: pRClOS, lane 4: DBL5 pspA insert, lane 
5: PRCT113, lane 6: BG9739 pspA insert, lane 7: pRCTll?, and lane 
8: LB1905 pspA insert. Fig, IB: Corresponding Southern blot of 
gel in Fig. lA probed with full-length Rxl pspA and hybridization 
detected as described in Example 1. The arrow indicates the 1.2 
kb pspA digested inserts from plasmid constructs and the PCR- 
amplified pspA fragments from the pneumococcal donor strains used 
in cloning. 

Figure 2 shows: Evaluation of strain RCT105 cell 
fractions containing truncated DBL5 PspA . Proteins from £. coli 
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cell fractions were resolved by 10% SDS-PAGE, transferred to NC, 
and probed with MAb XiR278. Lane 1: molecular weight markers 
(noted in kDa) , lane 2: full-length, native DBL5 PspA, lane 3: 
uninduced cells, lanes 4-6: induced cells; 1 hr, 2 hr, and 3 hr 
of IPTG induction respectively, lane 7: periplasmic proteins, 
lane 8: cytoplasmic proteins, and lane 9: insoluble cell 
wall/membrane material. 

Figure 3 shows: SDS-PAGE of R36A PspA (80 ng) column 
isolated from CDM-ET and an equal volume of an equivalent WG44.1 
prep. Identical gels are shown stained with Bio-Rad silver kit 
(A) or immunoblotted with PspA MAb XiR278(B). The PspA isolated 
from R36A shows the characteristic monomer (84 kDa) and dimer 
bands. 

Figure 4 shows: Cell lysates of pneumococcal isolates 
MC27 and MC28 were subjected to SDS-PAGE and transferred to 
nitrocellulose for Western blotting with seven MAb to PspA. 7D2 
detected a protein of 82 kDa in each isolate and XiR278 and 2A4 
detected a protein of 190 kDa in each isolate. MAb Xi64, Xii26, 
1A4 and SR4W4 were not reactive. Strains MC25 and MC26 yielded 
identical results. 

Figure 5 (Figs. 5A and 5B) shows: Southern blot of 
Hind III digest of MC25-MC28 chromosomal DNA developed at a 
stringency greater than 95 percent. A digest of Rxl DNA was used 
as a comparison. The blot was probed with LSMpspA13/2, a full 
length Rxl probe (Fig. 5) and LSMpspA12/6 a 5' probe of Rxl pspA 
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(Fig. 5) . The same concentration of Rxl DNA was used in both 
panels, but the concentrations of MC25-MC28 DNA in Fig. 5B were 
half that used in Fig. 5A to avoid detection of partial digests. 

Figure 6 shows: RFLP of amplified pspA. PspA from MC25 
was amplified by PGR using 5' and 3' primers for pspA (LSM13 and 
LSM, respectively) . The amplified DNA was digested with 
individual restriction endonucleases prior to electrophoresis and 
staining with ethidium bromide. Lane 1 BclT, Lane 2 BAMHI, Lane 
3 BstNI, Lane 4 PstI, Lane 5 Sad, Lane 6 EcoRL, Lane 7 Sjnal, 
Lane 8 JCpnI. 

Figure 7 shows: A depiction of PspA showing the 
relative location and orientation of the oligonucleotides. 

Figure 8 shows: Derivatives of the pneumoniae D39- 
Rxl family. 

Figures 9 to 10 show: Electrophoresis of pspA or 
amplified pspA product with Hhal (Fig. 9), Sau3AI (Fig. 10). 

Figure 11 shows: RFLP pattern of two isolates from six 

families. 

Figure 12 shows: RFLP pattern of two isolates from six 
families (using products from amplification with SKH2 and LSM13) . 

Figure 13 shows: Sequence primarily in the N-terminal 
half of PspA. 

Figure 14 shows: Cell lysates of pneumococcal isolates 
MC27 and MC2B, subjected to SDS-PAGE and Western blotting with 
seven MAbs to PspA; 7D2 detected a protein of 82 kOa in each 
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isolate, and Xi278 and 2A4 detected a protein of 190 kDa in each 
isolate; MAbs Xi64 , Xil26, 1A4 and SR4W4 were not reactive; 
strains MC25 and MC26 yielded identical results (not shown) . 

Figure 15A and 15B show: a Southern blot of Hind III 
digest of MC25-28 chromosomal DNA, using a digest of Rxl DNA as a 
comparison; the blot was probed with LSMpspA13/2, a full length 
Rxl probe (A), and LSMpspA12/6, a 5' probe of Rxl pspA (B) ; the 
same concentration of Rxl DNA was used in both panels, but the 
concentrations of MC25-28 DNA in B were half that used in A to 
avoid detection of partial digests. 

Figures 15C and 15D show: the nucleotide sequences of 
primers LSM13, LSM2, LSM12 and LSM6, and that of probes 
LSMpspA13/2 and LSMpspA12/6. 

Figure 16 shows: RFLP of amplified pspA, wherein PspA 
from MC25 was amplified by PCR using 5' and 3' primers for pspA 
(LSM13 and LSM2, respectively); the amplified DNA was digested 
with individual restriction endonucleases prior to 
electrophoresis and staining with ethidium bromide; Bel I was 
used in lane l; BamH 1 was used in lane 2; BstN I was used in 
lane 3; Pst I was used in lane 4; Sac I was used in lane 5; EcoR 
I was used in lane 6; Sma I was used in lane 7; and Kpn 1 was 
used in lane a. 

Figure 17 shows: position and orientation of 
oligonucleotides relative to domains encoded by pspA; numbers 
along the bottom of the Figure represent amino acids in the 
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0 

nature PspA polypeptide from strain Rxl, and arrows represent the 
relative position (not to scale) and orientation of 
oligonucleotides • 

Figure 18 shows: a restriction map of the pZero vector, 

Figure 19 shows: the nucleotide sequences of SKH2, 
LSM13, N192 and C588. 

Figure 20 shows: a comparison of the structural motifs 
of PspA and PspC; PspA has a smaller alpha-helical region, and 
does not contain the direct repeats within the alpha-helix 
(indicated by the dashed lines) ; the alpha-helical regions which 
are homologous between PspA and PspC are indicated by the dashed 
lines) ; the alpha-helical regions which are homologous between 
PspA and PspC are indicated by the striped pattern; and PCR 
primers are indicated by the arrows* 

Figure 21 shows: the amino acid and nucleotide 
sequence of PspC, wherein the putative -10 and -35 regions are 
underlined, and the ribosomal binding site is in lower case. 

Figure 22 shows: the Bestfit analysis of PspA and 
PspC; percent identity is 69% and percent similarity is 77%; 
amino acids of PspA are one the bottom line (1-588) and amino 
acids of PspC are on the top line (24 9-891) , and a dashed line 
indicated identity • 

Figure 23 shows: the coiled coil motif of the alpha- 
helix of PspC; amino acids that are not in the coiled coil motif 
are in the right column. 
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Figure 24 shows: a matrix plot comparison of the 
repeat regions of the alpha-helical region of PspC. 

Figure 25 shows: the sequence of the alpha helical and 
proline regions of LXS532 (Pspc.D39). 

Figure 26 shows: a comparison of nucleotides of 
pspA.Rxl to pspC. D39. 

Figure 27 shows: a BESTFIT analysis of pspC. EF6797 and 

pspC.D39. 

Figure 28 shows: the amino acid comparison of PspC of 
EF6797 and D39. 

Figure 29 shows: the amino acid comparison of PspC.D39 
and PspA.Rxl. 

DETAII.E D DESGRIPTTQW 

Knowledge of and familiarity with the applications 
incorporated herein by reference is assumed; and, those 
applications disclose the sequence of pspA as well as certain 
portions thereof, and PspA and compositions containing PspA. 

As discussed above and in the following Examples, the 
invention relates to truncated PspA, e.g., PspA C-terminal to 
position 192 such as a. a. 192-588 ("BCioo") 192-299 and 192-260 
of PspA eliciting cross-protection, as well as to DNA encoding 
such truncated PspA (which amplify the coding for these amino 
acid regions homologous to most PspAs) . 

The invention further relates to a pspA-like gene, or a 
pspC gene and portions thereof (e.g., probes, primers) which can 
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hybridize thereto and/or amplify that gene, as well as to DNA 
molecules which hybridize to pspA, so that one can, by 
hybridization assay and/or amplification, ascertain the presence 
of a particular pneumococcal strain; and, the invention provides 
that a PspC can be produced by the pspA-like or pspC sequence 
(which PspC can be used like PspA) . 

Indeed, the invention further relates to 
oligonucleotide probes and/or primers which react with pspA 
and/or pspC of many, if not all, strains, so as to permit 
identification, detection or diagnosis of any pneumococcal 
strain, as well as to expression products of such probes and/or 
primers, which can provide cross-reactive epitopes of interest. 

The repeat region of pspA and/ or pspC is highly 
conserved such that the present invention provides 
oligonucleotide probes or primers to this region reactive with 
most, if not all strains, thereby providing diagnostic assays and 
a means for identifying epitopes of interest. 

The invention demonstrates that the pspC gene is 
homologous to the pspA gene in the leader sequence, first portion 
of the proline-rich region and in the repeat region; but, these 
genes differ in the second portion of their proline-rich regions 
and at the very 3' end of the gene encoding the 17 amino acid 
tail of PspA. The product of the pspC gene is expected to lack a 
C-terminal tail, suggesting different anchoring than PspA. Drug 
interference with functions such as surface binding of the coding 

17 



wo 97/09994 



PCTAJS96/14819 



for repeat regions of pspA and the pspC genes, or with the repeat 
regions of the expression products, is therefore a target for 
intervention of pneumococcal infection. 

Further still, the invention provides evidence of 
additional pspA homologous sequences, in addition to pspA and the 
pspC sequence. The invention, as mentioned above, includes 
oligonucleotide probes or primers which distinguish between pspA 
and the pspC sequence, e.g., lsmi and LSM2, useful for diagnostic 
detecting, or isolating purposes; and LSMI and LSMIO or LSMl and 
LSM7 which amplify a portion of the pspC gene, particularly the 
portion of that gene which encodes an antigenic, immunological or 
protective protein. 

The invention further relates to a method for the 
isolation of native PspA by growth of pneumococci medium 
containing high concentrations of (about 0.9% to about 1.4%, 
preferably 1.2%) choline chloride, elation of live pneumococci 
with a salt solution containing choline chloride, e.g., about 1% 
about 3%, preferably 2% choline chloride, and growth of 
pneumococci in medium in which the choline in the medium has been 
almost or substantially completely replaced with a lower 
alkanolamine, e.g., C^-C^, preferably Cj alkanolamine, i.e., 
preferably alkanolamine, i.e., preferably ethanolamine (e.g., 
0.0000005% to 0.0000015%, preferably 0.000001% choline chloride 
plus 0.02% to 0.04% alkanolamine (ethanolamine), preferably 
0.03%). PspA from such pneumococci is then preferably isolated 
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from a choline-sepharose affinity column, thereby providing 
highly purified PspA. Such isolated and/or purified PspA is 
highly immunogenic and is useful in antigenic, immunological or 
vaccine composition. 

Indeed, the growth media of the pneumococci grown in 
the presence of the alkanolamine (rather than choline) contains 
PspA and is itself highly immunogenic and therefore useful as an 
antigenic, immunological or vaccine composition; and, is rather 
inexpensive to produce. Per microgram of PspA, the PspA in the 
alkanolamine medium is much more protective than PspA isolated by 
other means, e.g., from extracts. Perhaps, without wishing to 
necessarily be bound by any one particular theory, there is a 
synergistic effect upon PspA by the other components present 
prior to isolation, or simply PspA is more protective (more 
antigenic) prior to isolation and/or purification (implying a 
possibility of some loss of activity from the step of isolation 
and/or purification) . 

The invention further relates to the N-terminal 115 
amino acids of PspA, which is useful for compositions comprising 
an epitope of interest, immunological or vaccine compositions, as 
well as the DNA coding therefor, which is useful in preparing 
these N-terminal amino acids by recombination, or for use as 
probes and/or primers for hybridization and/or amplification for 
identification, detection or diagnosis purposes. 



19 



wo 97/09994 



PCTAJS96/14819 



The invention further demonstrates that there is a 
grouping among the pspA RFLP families. This provides a method of 
identifying families of different PspAs based on RFLP pattern of 
pspAs, as well as a means for obtaining diversity of PspAs in an 
antigenic, immunological or vaccine composition; and, a method of 
characterizing clonotypes of PspA based on RFLP patterns of PspA. 
And, the invention thus provides oligonucleotides which permit 
amplification of most, e.g., a majority, if not all of s. 
pneumoniae and thereby permit RFLP analysis of a majority, if not 
all, s. pneumoniae. 

The invention also provides PspC, having an approximate 
molecular weight of 105 kD, with an estimated pi of 6.09, and 
comprising an alpha-helical region, followed by a proline-rich 
domain and repeat region. A major cross-protective region of 
PspA comprises the C-terminal third of the alpha-helical region 
(between residues 192 and 260 of PspA), which region accounts for 
the binding of 4 of 5 cross-protective MAb, and PspA fragments 
comprising this region can elicit cross-protective immunity in 
mice. Homology between PspC and PspA begins at amino acid 148 of 
PspA, thus including the region from 192 to 299, and including 
the entire PspC sequence C-terminal of amino acid 486. Due to 
the substantial sequence homology between PspA and PspC in a 
region comprising the epitopes of interest, known to be 
protection eliciting, PspC is likely to comprise epitopes of 
interest similar to those found in PspA. Antibodies specific for 
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this region of PspA, i.e., between amino acids 148 and 299, 
should cross-react with PspC, and thus afford protection by 
reacting with PspC and PspA. Similarly, immunization with PspC 
would be expected to elicit antibodies cross-protective against 
PspA. 

An epitope of interest is an antigen or immunogen or 
immunologically active fragment thereof from a pathogen or toxin 
of veterinary or human interest. 

The present invention provides an immunogenic, 
immunological or vaccine composition containing the pneumococcal 
epitope of interest, and a pharmaceutically acceptable carrier or 
diluent. An immunological composition containing the 
pneumococcal epitope of interest, elicits an immunological 
response - local or systemic. The response can, but need not be, 
protective. Am immunogenic composition containing the 
pneumococcal epitope of interest, likewise elicits a local or 
systemic immunological response which can, but need not be, 
protective. A vaccine composition elicits a local or systemic 
protective response. Accordingly, the terms "immunological 
composition" and "immunogenic composition" include a "vaccine 
composition" (as the two former terms can be protective 
compositions) . 

The invention therefore also provides a method of 
inducing an immunological response in a host mammal comprising 
administering to the host an immunogenic, immunological or 
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vaccine composition comprising the pneumococcal epitope of 
interest, and a pharmaceutically acceptable carrier or diluent. 

The DNA encoding the pneumococcal epitope of interest 
can be DNA which codes for full length PspA, Pspc, or fragments 
thereof. A sequence which codes for a fragment of PspA or PspC 
can encode that portion of PspA or PspC which contains an epitope 
of interest, such as a protection-eliciting epitope of the 
protein. 

Regions of PspA and PspC have been identified from the 
Rxl strain of S. pneumoniae which not only contain protection- 
eliciting epitopes, but are also sufficiently cross-reactive with 
other PspAs from other S. pneumoniae strains so as to be suitable 
candidates for the region of PspA to be incorporated into a 
vaccine, immunological or immunogenic composition. Epitopic 
regions of PspA include residues l to lis, l to 314, 192 to 260 
and 192 to 588. DNA encoding fragments of PspA can comprise DNA 
which codes for the aforementioned epitopic regions of PspA; or 
it can comprise DNA encoding overlapping fragments of PspA, e.g., 
fragment 192 to 588 includes 192 to 260, and fragment 1 to 314 
includes 1 to 115 and 192 to 260. 

As to epitopes of interest, one skilled in the art can 
determine an epitope of immunodominant region of a peptide or 
polypeptide and ergo the coding DNA therefor from the knowledge 
of the amino acid and corresponding DNA sequences of the peptide 
or polypeptide, as well as from the nature of particular amino 
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acids (e.g., size, charge, etc.) and the codon dictionary, 
without undue experimentation. 

A general method for determining which portions of a 
protein to use in an immunological composition focuses on the 
size and sequence of the antigen of interest. "In general, large 
proteins, because they have more potential determinants are 
better antigens than small ones. The more foreign an antigen, 
that is the less similar to self configurations which induce 
tolerance, the more effective it is in provoking an immune 
response." Ivan Roitt, Essential Immunology . 1988. 

As to size, the skilled artisan can maximize the size 
of the protein encoded by the DNA sequence to be inserted into 
the viral vector (keeping in mind the packaging limitations of 
the vector) . To minimize the DNA inserted while maximizing the 
size of the protein expressed, the DNA sequence can exclude 
introns (regions of a gene which are transcribed but which are 
subsequently excised from the primary RNA transcript) . 

At a minimum, the DNA sequence can code for a peptide 
at least 8 or 9 amino acids long. This is the minimum length 
that a peptide needs to be in order to stimulate a CD4+ T cell 
response (which recognizes virus infected cells or cancerous 
cells) . A minimum peptide length of 13 to 25 amino acids is 
useful to stimulate a CD8+ T cell response (which recognizes 
special antigen presenting cells which have engulfed the 
pathogen) . See Kendrew, supra. However, as these are minimum 
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lengths, these peptides are likely to generate an immunological 
response, i.e., an antibody or T cell response; but, for a 
protective response (as from a vaccine composition) , a longer 
peptide is preferred. 

With respect to the sequence, the DMA sequence 
preferably encodes at least regions of the peptide that generate 
an antibody response or a T cell response. One method to 
determine T and B cell epitopes involves epitope mapping. The 
protein of interest "is fragmented into overlapping peptides with 
proteolytic enzymes. The individual peptides are then tested for 
their ability to bind to an antibody elicited by the native 
protein or to induce T cell or B cell activation. This approach 
has been particularly useful in mapping T-cell epitopes since the 
T cell recognizes short linear peptides complexed with MHC 
molecules. The method is less effective for determining B-cell 
epitopes" since B cell epitopes are often not linear amino acid 
sequence but rather result from the tertiary structure of the 
folded three dimensional protein. Janis Kuby, Immunology, (1992) 
pp. 79-80. 

Another method for determining an epitope of interest 
is to choose the regions of the protein that are hydrophilic. 
Hydrophilic residues are often on the surface of the protein and 
therefore often the regions of the protein which are accessible 
to the antibody. Janis Kuby, Immunology ^ (1992) P. 81. 
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Yet another method for determining an epitope of 
interest is to perform an X-ray cyrstallographic analysis of the 
antigen (full length) -antibody complex. Janis Kuby, Immunology . 
(1992) p. 80, 

Still another method for choosing an epitope of 
interest which can generate a T cell response is to identify from 
the protein sequence potential HLA anchor binding motifs which 
are peptide sequences which are known to be likely to bind to the 
MHC molecule. 

The peptide which is a putative epitope, to generate a 
T cell response, should be presented in a MHC complex. The 
peptide preferably contains appropriate anchor motifs for binding 
to the MHC molecules, and should bind with high enough affinity 
to generate an immune response. Factors which can be considered 
are: the HLA type of the patient (vertebrate, animal or human) 
expected to be immunized, the sequence of the protein, the 
presence of appropriate anchor motifs and the occurance of the 
peptide sequence in other vital cells. 

An immune response is generated, in general, as 
follows: T cells recognize proteins only when the protein has 
been cleaved into smaller peptides and is presented in a complex 
called the "major histocompatability complex MHC" located on 
another cell's surface. There are two classes of MHC complexes - 
class I and class II, and each class is made up of many different 
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alleles. Different patients have different types of MHC complex 
alleles; they are said to have a 'different HLA type'. 

Class I MHC complexes are found on virtually every cell 
and present peptides from proteins produced inside the cell. 
Thus, Class I MHC complexes are useful for killing cells which 
when infected by viruses or which have become cancerous and as 
the result of expression of an oncogene. T cells which have a 
protein called CD4 on their surface, bind to the MHC class I 
cells and secrete lymphokines. The lymphokines stimulate a 
response; cells arrive and kill the viral infected cell. 

Class II MHC complexes are found only on antigen- 
presenting cells and are used to present peptides from 
circulating pathogens which have been endocytosed by the antigen- 
presenting cells. T cells which have a protein called CDS bind 
to the MHC class II cells and kill the cell by exocytosis of 
lytic granules. 

Some guidelines in determining whether a protein is an 
epitopes of interest which will stimulate a T cell response, 
include: Peptide length - the peptide should be at least 8 or 9 
amino acids long to fit into the MHC class I complex and at least 
13-25 amino acids long to fit into a class II MHC complex. This 
length is a minimum for the peptide to bind to the MHC complex. 
It is preferred for the peptides to be longer than these lengths 
because cells may cut the expressed peptides. The peptide should 
contain an appropriate anchor motif which will enable it to bind 
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to the various class I or class II molecules with high enough 
specificity to generate an immune response (See Bocchia, M» et 
al. Specific Binding of Leukemia Oncogene Fusion Protein Peptides 
to HIA Class I Molecules . Blood 85:2680-2684; Englehard, VH, 
Structure of peptides associated with class I and class II MHC 
molecules Ann. Rev. Immunol. 12:181 (1994)). This can be done, 
without undue experimentation, by comparing the sequence of the 
protein of interest with published structures of peptides 
associated with the MHC molecules. Protein epitopes recognized 
by T cell receptors are peptides generated by enzymatic 
degradation of the protein molecule and are prestnted on the cell 
surface in association with class I or class II MHC molecules. 

Further, the skilled artisan can ascertain an epitope 
of interest by comparing the protein sequence with sequences 
listed in the protein data base. Regions of the protein which 
share little or no homology are better choices for being an 
epitope of that protein and are therefore useful in a vaccine or 
immunological composition. Regions which share great homology 
with widely found sequences present in vital cells should be 
avoided. 

Even further, another method is simply to generate or 
express portions of a protein of interest, generate monoclonal 
antibodies to those portions of the protein of interest, and then 
ascertain whether those antibodies inhibit growth in vitro of the 
pathogen from which the from which the protein was derived. The 
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Skilled artisan can use the other guidelines set forth in this 
disclosure and in the art for generating or expressing portions 
of a protein of interest for analysis as to whether antibodies 
thereto inhibit growth in vitro. For example, the skilled 
artisan can generate portions of a protein of interest by: 
selecting 8 to 9 or 13 to 25 amino acid length portions of the 
protein, selecting hydrophilic regions, selecting portions shown 
to bind from X-ray data of the antigen (full length) -antibody 
complex, selecting regions which differ in sequence from other 
proteins, selecting potential HLA anchor binding motifs, or any 
combination of these methods or other methods known in the art. 

Epitopes recognized by antibodies are expressed on the 
surface of a protein. To determine the regions of a protein most 
likely to stimulate an antibody response one skilled in the art 
can preferably perform an epitope map, using the general methods 
described above, or other mapping methods known in the art. 

As can be seen from the foregoing, without undue 
experimentation, from this disclosure and the knowledge in the 
art, the skilled artisan can ascertain the amino acid and 
corresponding DNA sequence of an epitope of interest for 
obtaining a T cell, B cell and/or antibody response. In 
addition, reference is made to Gefter et al., U.S. Patent No. 
5,019,384, issued May 28, 1991, and the documents it cites, 
incorporated herein by reference (Note especially the "Relevant 
Literature" section of this patent, and column 13 of this patent 
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which discloses that: "A large number of epitopes have been 
defined for a wide variety of organisms of interest. Of 
particular interest are those epitopes to which neutralizing 
antibodies are directed. Disclosures of such epitopes are in 
many of the references cited in the Relevant Literature 
section.") 

Further, the invention demonstrates that more than one 
serologically complementary PspA molecule can be in an antigenic, 
immunological or vaccine composition, so as to elicit better 
response, e.g., protection, for instance, against a variety of 
strains of pneumococci; and, the invention provides a system of 
selecting PspAs for a multivalent composition which includes 
cross-protection evaluation so as to provide a maximally 
efficacious composition - 

The determination of the amount of antigen, e.g., PspA 
or truncated portion thereof and optional adjuvant in the 
inventive compositions and the preparation of those compositions 
can be in accordance with standard techniques well known to those 
skilled in the pharmaceutical or veterinary arts. In particular, 
the amount of antigen and adjuvant in the inventive compositions 
and the dosages administered are determined by techniques well 
known to those skilled in the medical or veterinary arts taking 
into consideration such factors as the particular antigen, the 
adjuvant (if present), the age, sex, weight, species and 
condition of the particular patient, and the route of 
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administration. For instance, dosages of particular PspA 
antigens for suitable hosts in which an immunological response is 
desired, can be readily ascertained by those skilled in the art 
from this disclosure (see, e.g., the Examples), as is the amount 
of any adjuvant typically administered therewith. Thus, the 
skilled artisan can readily determine the amount of antigen and 
optional adjuvant in compositions and to be administered in 
methods of the invention. Typically, an adjuvant is commonly 
used as 0.001 to 50 wt% solution in phosphate buffered saline, 
and the antigen is present on the order of micrograms to 
milligrams, such as about 0.0001 to about 5 wt%, preferably about 
0.0001 to about 1 wt%, most preferably about 0.0001 to about 0.05 
wt% (see, e.g., Examples below or in applications cited herein). 

Typically, however, the antigen is present in an amount 
on the order of micrograms to milligrams, or, about 0.001 to 
about 20 wt%, preferably about 0.01 to about 10 wt%, and most 
preferably about 0.05 to about 5 wt% (see, e.g.. Examples below). 

Of course, for any composition to be administered to an 
animal or human, including the components thereof, and for any 
particular method of administration, it is preferred to determine 
therefor: toxicity, such as by determining the lethal dose (LD) 
and LD5Q in a suitable animal model e.g., rodent such as mouse; 
and, the dosage of the composition (s) , concentration of 
components therein and timing of administering the 
composition (s) , which elicit a suitable immunological response, 

30 



wo 97/09994 



PCT/US96/14819 



such as by titrations of sera and analysis thereof for antibodies 
or antigens, e.g., by ELISA and/or RFFIT analysis. Such 
determinations do not require undue experimentation from the 
knowledge of the skilled artisan, this disclosure and the 
documents cited herein. And, the time for sequential 
administrations can be ascertained without undue experimentation. 

Examples of compositions of the invention include 
liquid preparations for orifice, e.g., oral, nasal, anal, 
vaginal, peroral, intragastric, mucosal (e.g., perlingual, 
alveolar, gingival, olfactory or respiratory mucosa) etc., 
administration such as suspensions, syrups or elixirs; and, 
preparations for parenteral, subcutaneous, intradermal, 
intramuscular or intravenous administration (e.g., injectable 
administration), such as sterile suspensions or emulsions. Such 
compositions may be in admixture with a suitable carrier, 
diluent, or excipient such as sterile water, physiological 
saline, glucose or the like. The compositions can also be 
lyophilized. The compositions can contain auxiliary substances 
such as wetting or emulsifying agents, pH buffering agents, 
gelling or viscosity enhancing additives, preservatives, 
flavoring agents, colors, and the like, depending upon the route 
of administration and the preparation desired. Standard texts, 
such as "REMINGTON'S PHARMACEUTICAL SCIENCE", 17th edition, 1985, 
incorporated herein by reference, may be consulted to prepare 
suitable preparations, without undue experimentation. 
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Compositions of the invention, are conveniently 
provided as liquid preparations, e.g., isotonic aqueous 
solutions, suspensions, emulsions or viscous compositions which 
may be buffered to a selected pH. If digestive tract absorption 
is preferred, compositions of the invention can be in the "solid" 
form of pills, tablets, capsules, caplets and the like, including 
"solid" preparations which are time-released or which have a 
liquid filling, e.g., gelatin covered liquid, whereby the gelatin 
is dissolved in the stomach for delivery to the gut. if nasal or 
respiratory (mucosal) administration is desired, compositions may 
be in a form and dispensed by a squeeze spray dispenser, pump 
dispenser or aerosol dispenser. Aerosols are usually under 
pressure by means of a hydrocarbon. Pump dispensers can 
preferably dispense a metered dose or, a dose having a particular 
particle size. 

Compositions of the invention can contain 
pharmaceutically acceptable flavors and/or colors for rendering 
them more appealing, especially if they are administered orally. 
The viscous compositions may be in the form of gels, lotions, 
ointments, creams and the like and will typically contain a 
sufficient amount of a thickening agent so that the viscosity is 
from about 2500 to 6500 cps, although more viscous compositions, 
even up to 10,000 cps. may be employed. Viscous compositions have 
a viscosity preferably of 2500 to 5000 cps, since above that 
range they become more difficult to administer. However, above 
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that range ^ the compositions can approach solid or gelatin forms 
which are then easily administered as a swallowed pill for oral 
ingestion* 

Liquid preparations are normally easier to prepare than 
gels, other viscous compositions, and solid compositions. 
Additionally, liquid compositions are somewhat more convenient to 
administer, especially by injection or orally, to animals, 
children-, particularly small children, and others who may have 
difficulty swallowing a pill, tablet, capsule or the like, or in 
multi-dose situations. Viscous compositions, on the other hand, 
can be formulated within the appropriate viscosity range to 
provide longer contact periods with mucosa, such as the lining of 
the stomach or nasal mucosa. 

Obviously, the choice of suitable carriers and other 
additives will depend on the exact route of administration and 
the nature of the particular dosage form, e.g., liquid dosage 
form [e.g., whether the composition is to be formulated into a 
solution, a suspension, gel or another liquid form], or solid 
dosage form [e.g., whether the composition is to be formulated 
into a pill, tablet, capsule, caplet, time release form or 
liquid-filled form] . 

Solutions, suspensions and gels, normally contain a 
major amount of water (preferably purified water) in addition to 
the antigen, lipoprotein and optional adjuvant. Minor amounts of 
other ingredients such as pH adjusters (e.g., a base such as 
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NaOH) , emulsifiers or dispersing agents, buffering agents, 
preservatives, wetting agents, jelling agents, (e.g., 
methylcellulose) , colors and/or flavors may also be present. The 
compositions can be isotonic, i.e., it can have the same osmotic 
pressure as blood and lacrimal fluid. 

The desired isotonicity of the compositions of this 
invention may be accomplished using sodium chloride, or other 
pharmaceutically acceptable agents such as dextrose, boric acid, 
sodium tartrate, propylene glycol or other inorganic or organic 
solutes. Sodium chloride is preferred particularly for buffers 
containing sodium ions. 

Viscosity of the compositions may be maintained at the 
selected level using a pharmaceutically acceptable thickening 
agent. Methylcellulose is preferred because it is readily and 
economically available and is easy to work with. Other suitable 
thickening agents include, for example, xanthan gum, 
carboxymethyl cellulose, hydroxypropyl cellulose, carbomer, and 
the like. The preferred concentration of the thickener will 
depend upon the agent selected. The important point is to use an 
amount which will achieve the selected viscosity. Viscous 
compositions are normally prepared from solutions by the addition 
of such thickening agents. 

A pharmaceutically acceptable preservative can be 
employed to increase the shelf -life of the compositions. Benzyl 
alcohol may be suitable, although a variety of preservatives 
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including, for example, parabens, thimerosal, chlorobutanol, or 
benzalkonium chloride may also be employed. A suitable 
concentration of the preservative will be from 0.02% to 2% based 
on the total weight although there may be appreciable variation 
depending upon the agent selected* 

Those skilled in the art will recognize that the 
components of the compositions must be selected to be chemically 
inert with respect to the PspA antigen and optional adjuvant, 
This will present no problem to those skilled in chemical and 
pharmaceutical principles, or problems can be readily avoided by 
reference to standard texts or by simple experiments (not 
involving undue experimentation) , from this disclosure and the 
documents cited herein. 

The immunologically effective compositions of this 
invention are prepared by mixing the ingredients following 
generally accepted procedures. For example the selected 
components may be simply mixed in a blender, or other standard 
device to produce a concentrated mixture which may then be 
adjusted to the final concentration and viscosity by the addition 
of water or thickening agent and possibly a buffer to control pH 
or an additional solute to control tonicity. Generally the pH 
may be from about 3 to 7.5. Compositions can be administered in 
dosages and by techniques well known to those skilled in the 
medical and veterinary arts taking into consideration such 
factors as the age, sex, weight, and condition of the particular 
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patient or animal, and the composition form used for 
administration (e.g., solid vs. liquid). Dosages for humans or 
other mammals can be determined without undue experimentation by 
the skilled artisan, from this disclosure, the documents cited 
herein, the Examples below (e.g., from the Examples involving 
mice) • 

Suitable regimes for initial administration and booster 
doses or for sequential administrations also are variable, may 
include an initial administration followed by subsequent 
administrations; but nonetheless, may be ascertained by the 
skilled artisan, from this disclosure, the documents cited 
herein, and the Examples below. 

PGR techniques for amplifying sample DNA for diagnostic 
detection or assay methods are known from the art cited herein 
and the documents cited herein (see Examples), as are 
hybridization techniques for such methods. And, without undue 
experimentation, the skilled artisan can use gene products and 
antibodies therefrom in diagnostic, detection or assay methods by 
procedures known in the art. 

The following Examples are provided for illustration 
and are not to be considered a limitation of the invention. 
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EXAMPLES 

EXAMPLE 1 - Truncated Streptococcus pneimoniae PspA Molecules 
Elicit Cross-Protective Immunity Against 
Pneumococcal Challenge 

Since the isolation of S. pnBumoniae from human saliva 
in 1881 and its subsequent connection with lobar pneumonia two 
years later, human disease resulting from pneumococcal infection 
has been associated with a significant degree of morbidity and 
mortality, A recent survey of urgently needed vaccines in the 
developing and developed world places an improved pneumococcal 
vaccine among the top three vaccine priorities of industrialized 
countries* The currently licensed vaccine is a 23-valent 
composition of pneumococcal capsular polysaccharides that is only 
about 60% effective in the elderly and due to poor efficacy is 
not recommended for use in children below two years of age. 
Furthermore the growing frequency of multi-drug resistant strains 
of S. pneumoniae being isolated accentuates the need for a more 
effective vaccine to prevent pneumococcal infections. 

The immunogenic nature of proteins makes them prime 
targets for new vaccine strategies. Pneumococcal molecules being 
investigated as potential protein vaccine candidates include 
pneumolysis, neuraminidase, autolysin and PspA. All of these 
proteins are capable of eliciting immunity in mice resulting in 
extension of life and protection against death with challenge 
doses near the LDcq. PspA is unique among these macromolecules 
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in that is can elicit antibodies in animals that protect against 

inoculums lOO-fold greater than the LDgQ. 

PspA is a surface-exposed protein with an apparent 
molecular weight of 67-99 kDa that is expressed by all clinically 
relevant S. pneumoniae strains examined to date. Though PspAs 
from different pneumococcal strains are serologically variable, 
many PspA antibodies exhibit cross-reactivities with PspAs from 
unrelated strains. Upon active immunization with PspA, mice 
generate PspA antibodies that protect against subsequent 
Challenge with diverse strains of S. pneumoniae. The immunogenic 
and protection-eliciting properties of PspA suggest that it may 
be a good candidate molecule for a protein-based pneumococcal 
vaccine. 

Four distinct domains of PspA have been identified 
based on DNA sequence. They include a N-terminal highly charged 
alpha-helical region, a proline-rich 82 amino acid stretch, a C- 
terminal repeat segment comprised of ten 20-amino acid repeat 
sequences, and a 17-amino acid tail, a panel of MAbs to Rxl PspA 
have been produced and the binding sites of nine of these Mabs 
were recently localized within the Rxl pspA sequence in the 
alpha-helical region. Five of the Rxl Mabs were protective in 
mice infected with a virulent pneumococcal strain, WU2. Four of 
these five protective antibodies were mapped to the distal third 
(amino acids 192-260) of the alpha-helical domain of Rxl PspA. 
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Truncated PspAs containing amino acids 192-588 or 192- 
299, from pneumococcal strain Rxl were cloned and the recombinant 
proteins expressed and evaluated for their ability to elicit 
protection against subseguent challenge with S. pneumoniae WU2. 
As with full-length Rxl PspA, both truncated PspAs containing the 
distal alpha-helical region protected mice against fatal WU2 
pneumococcal infection. However, the recombinant PspA fragment 
extending from amino acid 192 to 588 was more immunogenic than 
the smaller fragment, probably due to its larger size. In 
addition, the protection elicited by the amino acid fragment 192- 
588 of Rxl was comparable to that elicited by full-length Rxl 
PspA* Therefore, cross-protective epitopes of other PspAs were 
also sought in the C-terminal two-thirds of the molecule. As 
discussed below, PspAs homologous to amino acids 192-588 of 
strain Rxl were amplified by PCR, cloned, and expressed in E. 
coll. Then three recombinant PspAs, from capsule type 4 and 5 
strains, were evaluated for their ability to confer cross- 
protection against challenge strains of variant capsular types. 
The data demonstrate that the truncated PspAs from capsular type 
4 and 5 strains collectively protect against or early death 
caused by challenge with capsular type 4 and 5 parental strains 
as well as type 3, 6A, and 6B S. pneumoniae. 

Bacterial strains and culture conditions . All 
pneumococci were from the culture collection of this laboratory, 
and have been described (Yother, J. et al., Infect. Immun. 1982; 
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36: 184-188; Briles, D.E., et al.. Infect. Immun. 1992; 60: lli- 
116; McDaniel, L.S., et al., Microb. Pathog. 1992; 13: 261-269; 
and McDaniel, L.S, et al., in: Ferretti, j.j. et al. , eds. 
G^netjgs of strepto cocci, e n terococci . a nd lantnf-.r.r^r^i 2.995; 283- 
286), with the exception of clinical isolates TJ0893, 0922134 and 
BG8740. Pneumococcal strains TJ0893 and 0922134 were recovered 
from the blood of a 43-year old male and an elderly female, 
respectively, s. pneumoniae BG8743 is a blood isolate from an 8- 
month old infant, strains employed in this study included 
capsular type 3 (A66.3, EF10197, WU2) , type 4 (BG9739, EF3296, 
EF5668, L81905), type 5 (DBL5) , type 6A (DBL6A, EF6796) , type 6B 
(BG7322, BG9163, DBLl) , type 14 TJ0893), type 19 (BG8090) , and 
type 23 (0922134, BG8743) . In addition, strain WG44.1, which 
expresses no detectable PspA, was employed in PspA-specif ic 
antibody analysis. All chemicals were purchased from Fisher 
Scientific, Fair Lawn, New Jersey unless indicate otherwise. 

S. pneumoniae were grown in Todd Hewitt broth (Difco, 
Detroit, Michigan) supplemented with 5% yeast extract (Difco). 
Mid-exponential phase cultures were used for seeding inocula in 
Lactated Ringer's (Abbott laboratories. North Chicago, Illinois) 
for challenge studies. For pneumococcal strains used in 
challenge studies, inocula ranged from 2.8 to 3.8 log^Q CFU 
(verified by dilution plating on blood agar) . Plates were 
incubated overnight in a candle jar at 370c. 



40 



wo 97/09994 



PCr/US96/14819 



E. coll DHl and BL21(DE3) were cultured in LB medium 
(1% Bacto-tryptone (Difco) , 0.5% Bacto Yeast (Difco) , 0.5% NaCl, 
0.1% dextrose). For the preparation of cell lysates, recombinant 
E. coil were grown in minimal E medium supplemented with 0.05 M 
thiamine, 0.2% glucose, 0.1% casamino acids (Difco), and 50 mg/ml 
kanamycin. Permanent bacterial stocks were stored at -80**C in 
growth medium containing 10% glycerol. 

Construction of plasmid-based strains . pET-9a 
(Novagen, Madison, Wisconsin) was used for cloning truncated pspA 
genes from fourteen S. pneumoniae strains: DBL5, DBL6A, WU2, 
BG9739, EF5668, L81905, 0922134, BG8090, BG8743, BG9163, DBLl, 
EF3296, EF6796, and EF10197 (Table 1). pspA gene fragments, from 
fifteen strains, were amplified by PGR using two primers provided 
by Connaught Laboratories, Swiftwater, Pennsylvania Primer N192- 
5'GGAAGGCCATATGCTCAAAGAGATTGATGAGTCT3' and primer C588 - 
5'CCAAGGATCCTTAAACCCATTCACCATTGGC3' were engineered with Ndel and 
BamHI restriction endonuclease sites, respectively. PCR- 
amplified gene products were digested with BamHI and Ndel, and 
ligated to linearized pET-9a digested likewise and further 
treated with bacterial alkaline phosphatase United States Bio- 
chemical Corporation, Cleveland, Ohio) to prevent recirculari- 
zation of the cut plasmid. Clones were first established in E. 
coli BL21(DE3) which contained a chromosomal copy of the T7 RNA 
polymerase gene under the control of an inducible lacUVS 
promoter . 
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E. coli DHl cells were transformed by the method of 
Hanahan (Hanahan, D. J. Mol. Biol. 1983; 166: 557-580). stable 
transfonnants were identified by screening on LB-kanamycin 
plates. Plasmid constructs, isolated from each of these strains, 
were electroporated (Electro Cell Manipulator 600, BTX 
Electroporation System, San Diego, California) into E. coli 
BL21(DE3) and their respective strain designations are listed in 
Table 1. The pET-9a vector alone was introduced into E. coli 
BL21(DES) by electroporation to yield strain RCT125 (Table 2). 
All plasmid constructs and PCR-amplif ied pspA gene fragments were 
evaluated by agarose gel electrophoresis (with 1 kb DNA ladder, 
Gibco BRL, Gaithersburg, Maryland). Next, Southern analysis was 
performed using LMpspAl, a previously described full-length pspA 
probe (McDaniel. L.S. et al., Microb. Pathog. 1992; 13: 261-269) 
random primed labeled with digoxigenin-ll-dUTP (Genius System, 
Boehringer Mannheim, Indianapolis, Indiana). Hybridization was 
detected with chemi luminescent sheets according to the 
manufacturer's instructions (Schleicher & Schuell, Keene, New 
Hampshire) . 

C^ll fractionation of rt ^combinant E. coli strains . 
Multiple cell fractions from transformed E. coli were evaluated 
for the expression of truncated PspA molecules. Single colonies 
were inoculated into 3 ml LB cultures containing kanamycin and 
grown overnight at 37«>c. Next, an 80 ml LB culture, inoculated 
with 1:100 dilution of the overnight culture, was grown at 37«»c 
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to mid-exponential phase (A^qq of ca. 0.5) and a 1 ml sample was 
harvested and resuspended (uninduced cells) prior to induction 
with isopropylthiogalactoside (IPTG, 0.3 mM final concentration). 
Following 1, 2, and 3 hr of induction, 0.5 ml of cells were 
centrifuges, resuspended, and labeled induced cells. The 
remaining culture was divided into two aliquots, centrifuged 
(4000 X g, 10 min, DuPont Sorvall RC 5B Plus) , and the 
supernatant discarded. One pellet was resuspended in 5 ml of 20 
xnM Tris-HCl ph 7.4 200 mM NaCl, 1 mM (ethylenedinitrilo) - 
tetraacetic acid disodium salt (EDTA) and frozen at -20*C 
overnight. Cells were thawed at 65*C for 30 min, placed on ice, 
and sonicated for vive lO-sec pulses (0.4 relative output ^ Fisher 
Sonic Dismembrator , Dynatech Laboratories, Inc. Chantilly, 
Virginia) . Next, the material was centrifuged (9000 x g, 20 min) 
and the supernatant was designed the crude extract-cytoplasmic 
fraction. The pellet was resuspended in Tris-NaCl-EDTA buffer 
and labeled the insoluble cell well and membrane fraction. The 
other pellet, from the divided induced culture, was resuspended 
in 10 ml of 30 mM Tris-HCl pH 8.0 containing 20% sucrose and 1 mM 
EDTA and incubated at room temperature for 10 min with agitation. 
Cells were then centrifuged, the supernatant removed, and the 
pellet resuspended in 5 mM MgS04 (10 ml, 10 min, shaking 40C 
bath) . This material was centrifuged and the supernatant was 
designated osmotic shock-periplasmic fraction. Cell fractions 
were evaluated by SDS-PAGE and* immunoblot analysis. 
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MAbs to PspA . PspA-specific monoclonal antibodies 
(MAbs) XiR278 and 1A4 were used as previously described (Grain, 
M.J. et al., 1990, Infect. Iramun.; §8: 3293-3299). MAb P50-92D9 
was produced by inmunization with DBL5 PspA. The PspA- 
specificity of MAb P50-92D9 was confirmed by Western Analysis by 
its reactivity with native PspAs from S. pneumoniae DBL5, BG9739, 
EF5668, and L81095 and its failure to recognize the PspA-control 
strain WG44.1. 

SDS-PAGE and imm unoblot analysis , e. coli cell 
fractions containing recombinant PspA proteins and biotinylated 
molecular weight markers (low range, Bio-Rad, Richmond, 
California) were separated by sodium dodecyl sulfate- 
polyacrylamide (10%; Bethesda Research Laboratories, 
Gaithersburg, Maryland) gel electrophoresis (SDS-PAGE) by the 
method of Laemmli (Laemmli, U.K. Nature 1970; 227: 680-685). 
Samples were first boiled for 5 min in sample buffer containing 
60 MM Tris pH 6.8, 1% 2-B-mercaptoethanol (Sigma, St. Louis, 
Missouri), 1% SDS, 10% glycerol, and 0.01% bromophenol blue. 
Gels were subsequently transferred (l hr, lOO volts) to 
nitrocellulose (0.45 mM pores, Millipore, Bedford, Massachusetts) 
as per the method of Towbin et al. Blots were blocked with 3% 
casein, 0.05% Tween 20 in 10. mM Tris, o.i M NaCl, pH 7.4 for 30 
min prior to incubating with PspA-specific monoclonal antibodies 
diluted in PBST for 1 hr at 25°C. Next, the blot was washed 3 
times with PBST before incubating with alkaline phosphatase- 
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labeled goat anti-mouse immunoglobulin (Southern Biotechnology 
Associates, Inc., Birmingham, Alabama) for 1 hr at 25°C. Washes 
were performed as before and blots was developed with 0.5 mg/ml 
5-bromo-4-chloro-3-indolyl phosphate and 0.01% nitro blue 
tetrazolium (Sigma) first dissolved in 150 /xl of dimethyl 
sulfoxide and then diluted in 1.5 M Tris-HCl pH 8,8. Dot blots 
were analyzed similarly. Lysate samples (2 ^1) were spotted on 
nitrocellulose filters (Millipore) , allowed to dry, blocked, and 
detected as just described. 

Preparation of cell lysates containing recombinant PspA 
proteins . Transformed E. coli strains RCT105, RCT113, RCT117, 
and RCT125 (Table 2) were grown in mid-exponential phase in 
minimal E medium before IPTG induction (2 mM final concentration, 
2 hours, 37*»C). Cultures were harvested by centrif ugation (10 
min at 9000 x g) , resuspended in Tris-acetate pH 6.9, and frozen 
at -80**c overnight. Samples were thawed at 65**C for 30 min, 
cooled on ice, and sonicated. Next the samples were treated with 
0.2 mM AEBSF (Calbiochem, La Jolla, California) at 37**C for 30 
min and finally centrifuged to remove cell wall and membrane 
components. Dot blot analysis was performed using PspA-specif ic 
MAbs to validate the presence of recombinant, truncated PspA 
molecules in the lysates prior to their use as immunogens in 
mice. Unused lysate material was stored at -20**C until 
subsequent immunizations were performed. 
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Mouse iminunization and chaliAnrf o CBA/CAHN-XID/J mice 
(Jackson Laboratories, Bar Harbor, Maine) , 6-12 weeks old, were 
employed for protection studies. These mice carry a X-linked 
immunodeficiency that prevents them from generating antibody to 
polysaccharide components, thus making them extremely susceptible 
to pneumococcal infection. Animals were immunized subcutaneously 
with cell lysates from E coli recombinant strains RCT105, RCT113, 
RCT117, and RCT125 (Table 2) in complete Freund's adjuvant for 
primary immunizations. Secondary injections were administered in 
incomplete adjuvant and subsequent boosts in dHjO. Immunized and 
nonimmunized mice (groups of 2 to 5 animals) were challenged with 
S. pneumoniae strains A66.3, BG7322, DBL6A, WU2, DBL5, BG9739, 
and L81905 intravenously (tail vein) to induce pneumococcal 
sepsis. Infected animals were monitored for 21 days and mice 
that survived the 3-week evaluation period were designated 
protected against death and scored as surviving 22 days for 
statistical analysis. Protection that resulted in extension of 
life was calculated as a comparison between mean number of days 
to death for immunized versus pooled control mice (nonimmunized 
and RCT125 sham-immunized; total of 6-7 animals) . 

Determination o f PspA serum levels . Mice were bled 
retro-orbitally following the secondary boost and again prior to 
challenge. Representative mouse titers were evaluated by enzyme- 
linked immunorsorbent assay (ELISA) using native, parental PspAs 
isolated from pneumococcal strains DBL5, BG9739, and L81905. 
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PspAs were immobilized on microtiter plates by incubating in 0.5 
NaHC03, 0.5 M Na2C03pH9.5 at 4°C overnight. Alkaline 
phosphatase-labeled goat anti-mouse immunoglobulin (Southern 
Biotechnology Associates, Inc.) was used to detect mouse serum 
antibodies. Color development was with p-nitrophenyl phosphate 
(Sigma, l mg/ml) in 0.5 m MgCLj pH 9.8 with 10% diethanolamine 
and absorbance was read at 405 nm after a 30 min incubation. 
Reciprocal titers were calculated as the last dilution of 
antibody that registered an optical density value of 0.1. Sera 
from individual mice within a particular immunogen group were 
evaluated separately and then the respective titers from four 
mice per group were combined to obtain titer range (Table 3) . 

Statistics . The one-tailed Fisher exact and two sample 
rank tests were used to evaluate protection against death and 
extension of life in the mouse model. 

Cloning of truncated psvA genes . Using primers N192 
and C588, truncated pspA genes from fifteen diverse pneumococcal 
strains representing eight different capsular types (Table l) 
were amplified by PCR. Even though variability exists in pspA 
genes from different strain, this result demonstrates that 
sufficient conservation exists between variant pspA genes to 
allow sequence amplification in all strains examined to date. 
Successful pspA PCR-amplif ication extended to all capsule types 
evaluated. 
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Fourteen of the amplified pspA genes were cloned and 
three clones containing truncated PspA molecules from 
pnexnnococcal strains DBL5, BG9739, and L81905 were further 
studies (Table 2). To verify the constructions, plasmids from 
recombinant E, coli strains (RCT105, RCT113, RCT117, and RCT125 
(Table 2) were isolated, digested with Ndel and BAMHl restriction 
endonucleases, and electrophoresed in 1% agarose side-by-side 
with the PCR products used in their respective constructions 
(Figure lA) . The digestion reaction was complete for pRCTlOS, 
while PRCT113 and pRCTll? digestions were incomplete (lanes 5 and 
7, respectively). This gel was denatured and DNA transferred to 
nylon for Southern analysis. Figure IB depicts the corresponding 
Southern blot probed with full-length RxlpspA DNA. Lane 1 
contains pRCT125, digested vector alone, which does not react 
with the pneumococcal DNA-specific probe, as expected. The pspA- 
specific probe hybridized with the PCT products and the digested 
plasmid inserts (see arrow. Figure IB) as well as the partially 
undigested pRCTll3 and pRCTll7 (lane 5 and 7), confirming 
successful cloning of DBL5, BG9739, and L81905 pspA DNA. 
Constructions were similarly confirmed with the eleven additional 
recombinant strains containing truncated pspA genes from S. 
pjieuiDoniae strains of different capsular and PspA types. 

Expression of recom binant PspA in E. coli Bl21fDe3K 
Transformed coli strains RCT105, RCT113, RCT117, and RCT125 
were cultured to mid-exponential phase prior to the addition of 
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IPTG to induce expression of the cloned, truncated pspA gene in 
each strain. A cell fractionation experiment was performed to 
identify the location of recombinant PspA proteins in transformed 
E. coli strains. Samples representing uninduced cells, included 
cells (1 hr, 2 hr, and 3 hr time intervals) , the periplasmic 
fraction, the cytoplasmic fraction, and insoluble cell 
wall/membrane material were resolved by SDS-PAGE- Proteins were 
then transformed to nitrocellulose and Western analysis was 
performed using monoclonal antibodies specific for PspA epitopes. 

Figure 2 reveals that both the cytoplasmic (lane 8) and 
the insoluble matter fractions (lane 9) , from recombinant strain 
RCT 105, contain a protein of approximately 53.7 kDa that is 
recognized by MAb XiR278 that is not seen in the uninduced cell 
sample (lane 3) . This protein increases in quantity in direct 
correlation with the length of IPTG induction (lanes 4-6; 1 hr, 2 
hr, and 3 hr respectively) . No truncated RCT105 PspA was found 
in the periplasmics fraction (lane 7), which was expected since 
the pET-9a vector lacks a signal sequence that would be necessary 
for directing proteins to the periplasm. The observed molecular 
weight (ca. 53.5 kDa) is larger than the predicted molecular 
weight for the 1.2 kb DBLSpspA gene product (43.6 kDa; Figure lA, 
lane 4). Like full-length Rxl PspA, the observed and predicted 
molecular weights for truncated PspAs do not agree precisely. In 
addition, immunoblot analysis was performed for recombinant E. 
coli strains RCT113, and RCT117 (using MAbs 1A4 and P50-92D, 
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respectively) and similar results were obtained, while no cell 
fractions from control strain RCT125 were recognized by MAb 
XiR278. 

Evaluating the protective capacity of recombinant, 
truncated PspAs . The truncated PspA proteins from strains 
RCT113, RCT117, and RCT105 were expressed and analyzed for their 
ability to generate cross-protection against a battery of seven 
S. pneujnoiiiae strains. Control mice (non-immunized and RCT125 
sham-immunized) and recombinant PspA- immunized mice were 
challenged with mouse-virulent strains A66.3, BG7322, DBL6A, WU2, 
DBL5, BG9739, and L81905. Table 3 presents the day of death for 
each infected mouse* 

Immunization with truncated PspA from RCT113, RCT117, 
and RCT105 conferred protection against death for all mice 
challenged with capsular type 3 strains (A66.3 and WU2 (Table 3). 
The three truncated PspAs also provided significant protection 
against death with DBL6A, and BG7322 pneumococci (capsular types 
6A and 68, respectively) . In addition, immunization with 
recombinant RCT113 PspA extended days to death in mice challenged 
with strains DBL5, BG9739, and L81905, while RCT117 PspA 
prolonged the lives of mice inoculated with BG9739 pneumococci 
(Table 3) - Truncated BG9739 PspA elicited protection against all 
challenge strains (100%) evaluated in this study, while 
recombinant LB 1905 and DBL5 truncated PspAs conferred protection 
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against death with 71% and 57% of s. pneumoniae challenge 
strains, respectively. 

Anti-PspA antibody titers elicited by the three 
immunogens vary over approximately a 10-fold range (Table 3) . 
The lowest antibody levels were elicited by RCT105 and this 
truncated PspA also elicited protection against the fewest number 
of challenge strains. RCT113 and RCT117 elicited three and nine 
time as much anti-PspA antibody, respectively. As expected, no 
antibody to PspA was detected in nonimmunized mice nor was 
specif ic-PspA antibody measured in mice immunized with the 
vector-only control strain (RCT125) . 

In summary, immunization with RCT113 and RCT117 PspAs 
protected mice against fatal challenge with capsular type 3 and 
6A strains and extended life for mice inoculated with type 4, 5, 
and 6B pneumococci. RCT105 PspA immunization protected against 
fatal infection with capsular type 3 and 6B strains and prolonged 
time to death for type 6A 5. pneumoniae but offered not 
protection against type 4 and 5 strains. These data demonstrate 
that truncated PspAs from capsular type 4 and 5 pneumococci 
collectively protect mice and ergo other hosts, such as humans, 
against or delay death caused by each of the seven challenge 
strains. In general, however, more complete protection was 
observed against strains of capsular type 3, 6A, and 6B than 
against type 4 and 5 S. pneumoniae. 
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PspA has been shown to be a protection-eliciting 
molecule of S. pneumoniae. Immunization with PspA has also been 
shown to be cross-protective, although eliciting more complete 
protection against certain strains than others. Thus, it is 
possible that a broadly protective PspA vaccine might need to 
contain PspAs of more than one pneumococcal strain. The distal 
third of the alpha-helical region of PspA has been identified as 
a major protective region of PspA. Moreover, this region is 
presented in a very antigenic form when expressed with the intact 
C-terminal half of the molecule, in this Example, the ability to 
use truncated PspA proteins homologous to the region of Rxl PspA 
extending from amino acid residue 192 to the C-terminus at 
residue 588 is demonstrated. 

The C-terminal two-thirds of PspA was cloned from 
fourteen strains by PCR amplification of a gene fragment of the 
appropriate size (1.2 kb) which hybridized with full-length Rxl 
pspA. Successful PCR amplification extended to all capsule types 
analyzed. Thus, the C-terminal two-third of PspA may be 
amplified from many, if not all, pneumococcal capsule types with 
Rxl pspA-specific primers. This technique is thus applicable to 
the development of antigenic immunological or vaccine 
compositions containing multiple PspA or fragments thereof. 

Of these clones, three truncated PspA proteins were 
expressed and evaluated in mouse immunization studies to 
determine their ability to cross-protect against challenge with a 
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variety of pneumococcal capsular types. All three recombinant 
PspAs elicited antibody reactive with their respective donor PspA 
and all three elicited protection against pneumococcal infection. 
Of the two truncated PspA proteins that elicited the highest 
antibody responses, 100% and 71% of the challenge strains were 
protected. RCT105 PspA, which elicited the lowest titers of 
PspA-specif ic antibody, yielded protection against 57% of S. 
pneumoniae strains evaluated, with all truncated PspAs, 
significant levels of protection were observed in four of the 
seven challenge strains. In fact, in all instances except for on 
(RCTlOS-immunized mice challenged with strain BG9739) the trend 
was for truncated PspA-immunization to elicit protection against 
pneumococcal challenge. These results demonstrate that truncated 
Rxl PspA (amino acids 192-588) cross-protects mice against fatal 
S. pneumoniae WU2 challenge. More importantly, these data show 
that the homologous regions of diverse PspAs demonstrate 
comparable cross-protective abilities. 

Strains of capsular type 4 and 5 were more difficult to 
protect against than were type 3, 6A and 6B pneumococcal strains. 
Serological differences in PspAs might affect cross-protection in 
some cases. Yet the difficulty in protecting against the type 4 
and 5 strains used herein could not be explained on this basis, 
since the truncated PspA immunogens were cloned from the same 
three type 4 and 5 strains used for challenge. Both PspAs from 
the type 4 strains delayed death caused by one or both type 4 
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Challenge strains but neither could prevent death caused by 
either type 4 pneumococcal strain. Moreover, the truncated PspA 
from the type 5 strain DBL5 elicited protection against death or 
delayed death with strains of capsular types 3, 6A and 6B but 
failed to protect against infection with its donor strain or 
either type 4 challenge strain. 

There may be several reasons why the truncated PspAs 
from capsular type 4 and type 5 strains failed to protect against 
death even with their homologous donor S. pneumoniae strains. 
One possibility is that the type 4 and 5 strains chosen for study 
are especially virulent in the XID mouse model. XID mice fail to 
make antibodies to polysaccharides and are therefore extremely 
susceptible to pneumococcal infection with less than 100 CFU of 
most strains, including those of capsular type 3, 4, 5, 6A, and 
6B. The increased mouse virulence of types 4 and 5 is apparent 
from the fact that in immunologically normal mice these strains 
have lower LD50S and/ or are more consistently fatal than strains 
of capsular types 3, 6A, or 63. 

Another possibility is that epitopes critical to 
protection-eliciting capacity with capsular type 4 and 5 strains 
are not present in the C-terminal two-thirds of PspA (amino acids 
192-588) , the truncated fragments used for immunization. The 
critical epitopes for these strains may be located in the N- 
terminal two thirds of the alpha-helical region of their PspA 
molecules. Finally, it is also possible that PspA may be less 
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exposed on some S. pneumoniae strains than others. Strain Rxl 
PspA amino acid sequence does not contain the cell wall 
attachment motif LPXTGX described by Schneewind et al. found in 
many gram-positive bacteria. Rather, PspA has a novel anchoring 
mechanism that is mediated by choline interactions between 
pneximococcal membrane-associated lipoteichoic acid and the repeat 
region in the C-terminus of the molecule. Electron micrographic 
examination has confirmed the localization of PspA on the 
pneumococcal surface and PspA-specif ic MAb data supports the 
accessibility of surface-exposed PspA. However, it is not known 
whether S. pneumoniae strains differ substantially in the degree 
to which different PspA regions are exposed to the surrounding 
environment- Nor is it known if the quantity of PspA expressed 
on the bacterial cell surface differs widely between strains. 
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Table 1. pspA recombinant strains categorized by pneumococcal 
capsular type. 



Capsular 


Parent 


ResDective 


Type 


Strains 


Recombinant Stramc 


3 


WU2, EF10197 


RCTlll, RCT137 


A 
*k 


xy^yJoy, £r5668 


Rcni3, Rcnis 




L81905,EF3296 


RCTH7, RCn33 


5 


DBL5 


Rcnos 


6A 


DBL6A, EF6796 


RCn09, RCn35 


6B 


BG9163, DBLl 


RCT129, RCn31 


14 


TT0893 


none* 


19 


BG8090 


Rcmi 


23 


0922134, BG8743 


RCT119, RCn23 



* Truncated psjzA amplified recently, not yet cloned 
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Table 2. Description of recombinant strains used in evaluating the 
protection-elid ting capacity of truncated PspAs in mice. 

Recombinant Capsxde Type 

Strain Description of Parent PspA 



RCT 105 BL21(DE3) Ejcoli with pET'9a:DBL5 5 

RCT113 BL21(DE3)E,^withpET-9a:BG9739 4 

RCT 117 BL21PE3) E> coli with pET-9a:L81905 4 

RCT 125 BL21PE3) E. coli with pET-9a (vector only) 
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EXAMPLE 2 - Localization of protection-eliciting epitopes 
and PspA of S. pneumoniae 

This Example, the ability of PspA epitopes on two PspA 
fragments (amino acids 192-588 and 192-299) to elicit cross- 
protection against a panel of diverse pneumococci is 
demonstrated. Also, this Example identifies regions homologous 
to amino acids 192-299 of Rxl in 15 other diverse pneumococcal 
strains. The DNA encoding these regions was then amplified and 
cloned. The recombinant PspA fragments expressed were evaluated 
for their ability to elicit cross-protection against a panel of 
virulent pneumococci. 

Bacterial strains and media conditions > S. pneumoniae 
strains were grown in Todd Hewitt broth with 0.5% yeast extract 
(THY) (both from Difco Laboratories, Detroit, Michigan) at 37«C 
or on blood agar plates containing 3% sheep blood at 37 under 
reduced oxygen tension. E. coli strains were grown in Luria- 
Bertani medium or minimal E medium. Bacteria were stored at - 
80<*C in growth medium supplemented with 10% glycerol. coli 
were transformed by the methods of Hanahan (Hanahan, D. J. Mol. 
Biol. 1983; 166 : 557). Ampicillin (Ap) was used at a 
concentration of 100 Mg/ml for £. Coli. 

Construction of pIN-III-ompA3 and pMAL-based E. Coli 
strains. Recombinant plasmids pBClOO and pBAR416 that express 
and secrete pspA fragments from E. Coli were constructed with 
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pIN-III-oinpA3 as previously described (McDaniel, L.S. et al., 
Microb. Pathog. 1994; 17: 323). 

The pMAL-p2 vector (New England Biolabs, Protein Fusion 
& Purification System, catalog #800) was used for cloning pspA 
gene fragments to amino acids 192-299 from strain Rxl and from 7 
other S. pneumoniae strains: R36A, D39, A66, BF9739, DBL5, 
DBL6A, and LMIOO. Amplification of the pspA gene fragments was 
done by the polymerase chain reaction (PGR) as described 
previously (McDaniel, L.S. et al., Microb. Pathog. 1994; 17: 323) 
using primers 5 ' CCGGATCCGCTCAAAGAGATTGATGAGTCTG3 ' (LSM4] and 
5'CTGAGTCGACTGAGTTTCTGGAGCTGGAGC3' [LMS6] made with BamHI and 
Sail restriction endonuclease sites, respectively. Primers were 
based on the sequence of Rxl PspA. PGR products and the pMAL 
vector were digested with BAMHI and Sail, and ligated together. 
Clones were transformed into E. Coli DH5a by the methods of 
Hanahan. Stable transf ormants were selected on LB plates 
containing lOO^g/ml Ap. These clones were screened on LB plates 
containing 0.1 mM IPTG, 80 /xg/ml X-gal and 100 /ig/ml Ap and 
replica LB plates with 100 ^g/ml Ap according to the 
manufacturer's instructions. The strain designations for these 
constructs are listed in Table 6. Positive clones were evaluated 
for the correct pspA gene fragment by agarose gel electrophoresis 
following plasmid isolation by the methods of Birnboim and Doly 
(Birnboim, H.G. et al., Nucl. Acids Res, 1979, 7: 1513). 
Southern analysis was done as previously described using a full- 
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length pspA probe (McDaniel, L.S. et al., Microb. Pathog. 1994; 
17: 323), randomly primed labeled with digoxigenin-ll-dUTP 

(Genius System, Boehdinger Mannheim, Indianapolis, Indiana) and 
detected by chemi luminescence. 

Expression of re combinant PsdA protein fragments . For 
induction of expression of strains BClOO.and BAR416, bacteria 
were grown to an optical density of approximately 0.6 at 660 nm 
at 37**C in minimal media, and rpTG was added to a final 
concentration of 2 mM. The cells were incubated for an 
additional 2 hours at 37«C, harvested, and the periplasmic 
contents released by osmotic shock. For strains BAR36A, BAR39, 
BAR66, BAR5668, BAR9739, BARL5, BAR6A and BARIOO, bacteria were 
grown and induced as above except LB media + 10 mM. glucose was 
the culture medium. Proteins from these strains were purified 
over an amy lose resin column according to the manufacturer's 
instructions (New England Biolabs, Protein Fusion & Purification 
System, Catalog #800). Briefly, amylose resin was poured into a 
10 mL column and washed with column buffer. The diluted osmotic 
shock extract was loaded at a flow rate of approximately l 
mL/minute. The column was then washed again with coliunn buffer 
and the fusion protein eluted off the column with column buffer 
containing 10 mM maltose. Lysates were stored at -20**c until 
further use. 

Characteriza tion of truncated PspA proteins used for 
immunization > The truncated PspA molecules, controls and 
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molecular weight markers (Bio-Rad, Richmond, CA) were 
electrophoresed in a 10% sodium dodecyl (SDS) - polyacrylamide 
gel and electroblotted onto nitrocellulose. Rabbit polyclonal 
anti-PspA serum and rabbit antimaltose binding protein were used 
as the primary antibodies to probe the blots. 

A direct binding ELISA procedure was used to 
quantitatively confirm reactivities observed by immunoblotting. 
For all protein extracts, osmotic shock preparations were diluted 
to a concentration of 3 nq/ml in phosphate buffered saline (PBS), 
and 100 nl was added to the wells of Immulon 4 microtitration 
plates (Dynatech Laboratories, Inc., Chantilly, VA) . After 
blocking with 1.5% bovine serum albumin in PBS, unf ractionated 
tissue culture supernates of individual MAbs were titered in 
duplicated by three-fold serial dilution through seven wells and 
developed using an alkaline phosphatase-labeled goat anti-mouse 
immunoglobulin secondary antibody (Southern Biotech Associates, 
Birmingham, AL) and alkalinephosphatase substrate (Sigma, St. 
Louis, MO) . The plates were read at 405 nm in a Dynatech plate 
reader after 25 minutes, and the 30% end point was calculated for 
each antibody with each preparation. 

Immunization and Protection Assavs . six to nine week 
old CBA/CAHN-XID/J (CBA/N) mice were obtained from the Jackson 
Laboratory, Bar Harbor, Maine, CBA/N mice carry an X-linked 
immunodeficiency trait, which renders them relatively unable to 
respond to polysaccharide antigens, but they do respond with 
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normal levels of antibodies against protein antigens. Because of 
the absence of antibodies reactive with the phosphocholine 
determinant of C-polysaccharide in their serum, the mice are 
highly susceptible to pneumococcal infection. Mice immunized 
with the BClOO fragment were injected inguinally with antigen 
emulsified in CFA, giving an approximate dose of 3 /xg of protein 
per mouse. Fourteen days later the mice were boosted 
intraperitoneally with 3 iiq of antigen diluted in Ringer's 
lactate without adjuvant. Control mice were immunized following 
the same protocol with diluent and adjuvant, but no antigen. 
Mice immunized with the BAR416 fragment were injected with 0.2 ml 
at two sites in the sublinguinal area with antigen emulsified in 
CFA. The mice were boosted inguinally fourteen days later with 
antigen emulsified in IFA and were boosted a second time fourteen 
days later intraperioneally with 0.2 ml of antigen diluted in 
Ringer's lactate without adjuvant. 

Mice that were immunized with the homologues of Rxl 
BAR416 were immunized as described above. The control animals 
followed the same immunization protocol but received maltose 
binding protein (MBP) diluted 1:1 in CFA for their immunization 
and were also boosted with MBP. 

Serum analysis . Mice were retro-orbital ly bled with a 
75 /il heparinized microhematocrit capillary tube (Fisher 
Scientific) before the first immunization and then once 
approximately 2 hours before challenge with virulent pneumococci. 
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The serum was analyzed for the presence of antibodies to PspA by 
an enzyme-linked immunosorbent assay (ELISA) using native full- 
length R36A PspA as coating antigen as previously described 
(McDaniel, L.S. Microb. Pathog. 1994; 2J.: 323). 

intravenous infection of mi pp . Pneumococcal cultures 
were grown to late log phase in THY. Pneumococci were diluted to 
10* CFU based on the optical density at 420 nm into lactated 
Ringer's solution. Seven days following the last boost injection 
for each group, diluted pneumococci were injected intravenously 
(tail vein) in a volume of 0.2 ml and plated on blood agar plates 
to confirm the numbers of CFU per milliliter. The final 
challenge dose was approximately 50-100 times the LD50 of each 
pneumococcal strain listed in Tables 4-6. The survival of the 
mice was followed for 21 days. Animals remaining alive after 21 
days were considered to have survived the challenge. 

Statistical analysis . Statistical significance of 
differences in days to death was calculated with the Wilcoxon 
two-sample rank test. Statistical significance of survival 
versus death was made using the Fisher exact test, in each case, 
groups Of mice immunized with PspA containing preparations were 
compared to unimmunized controls, or controls immunized with 
preparations lacking PspA. One-tailed, rather than two-tailed, 
calculations were used since immunization with PspA or fragments 
of PspA has never been observed to cause a statistically 
significant decrease in resistance to infection. 



wo 97/09994 



PCT/US96/14819 



Cloning into pMAL vector . Using primers based on the 
sequence of Rxl PspA, LSM4 and LSM6, pspA gene fragments were 
amplified by PGR from fifteen out of fifteen pneumococcal strains 
examined. Seven of the eleven gene fragments were cloned into 
pMAL-*p2 and transformed into E. coli (Table 6) . The correct 
insert for each new clone was verified by agarose gel 
electrophoresis and Southern hybridization analysis. Plasmids 
from recombinant E. coli strains BAR36A, BAR39, BAR66, BAR9739, 
BARL5, BAR6A and BARIOO were isolated, digested with BamHI and 
Sail restriction endonucleases and electrophoresed on a 0.7% TBE 
agarose gel. The gel was then denatured and the DNA transferred 
to a nylon membrane for southern hybridization. The blot was 
probed with full-length Rxl pspA DNA at high stringency 
conditions. The cloning of R36A, D39, A66, BG9739, DBL5, DBL6A 
and LMIOO pspA DNA into pMal-p2 was confirmed by the recognition 
of all BamKl and Sail digested DNA inserts by the Rxl probe. 

Expression and conformation of truncated recombinant 
proteins . The transformed E. coli strains BAR36A, BAR39, BAR66, 
BAR9739, BARL5, BAR6A and BARIOO were grown in LB media 
supplemented with 10 mM glucose and induced with 2 mM IPTG for 
expression of the truncated PspA protein fused with maltose 
binding protein. Transformed E. coli strains BClOO and BAR416, 
which express PspA fragments fused to the OmpA leader sequence in 
the pIN-III-ompA3 vector, were grown in minimal medium and 
induced with 2 mM IPTG for expression. Both vectors, pIN-:III- 
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oinpA3 and pMal-p2, are vectors in which fusion proteins are 
exported to the periplasmic space. Therefore, an osmotic shock 
extract from the pMal-p2 containing bacteria was then run over an 
amy lose column for purification and resolved by SDS-PAGE western 
blotting. The western blot of the protein extracts from BAR36A, 
BAR39, BAR66, BAR9739, BARL5, BAR6A and BARIOO were recognized by 
a rabbit polyclonal antibody made to strain BClOO PspA. The 
apparent of full-length PspA from WU2 is 91.5 kD. The M^. of 
maltose binding protein is 4 2 kD and the expected M*^ for the PspA 
portion of the fusion is 12 kD. All extracts exhibited molecular 
weights that ranged from 54 to 80 kD. This range of molecular 
weights can be attributed to the variability of pspA among 
different pneumococcal strains. An ELISA, with plates coated 
with the various cloned fragments quantitatively confirmed the 
reactivities that were observed in the western blots with all 
protein extracts. 

Protection and cross- nrotection against fatal 
pneumococcal infection elicited bv cloned PspA fragments , CBA/N 
mice were immunized with the truncated PspA fragment encoded by 
pBClOO, which is composed of amino acids 192 to 588 of Rxl PspA, 
and challenged with 13 different S. pneumoniae strains 
representing 7 different capsular types (Table 4) . With all 13 
strains, the immunization resulted in protection from death or an 
extended time to death, with 10 of the strains the difference 
was statistically significant. With strains of capsular types 3, 
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6A, and SB, all immunized mice were protected against death* 
Although there were fewer survivors in the case of capsular types 
2, A, and 5, the immunization with BClOO resulted in significant 
increases in times to death. 

The BClOO immunization studies made it clear that 
epitopes C-terminal to residue 192 could elicit cross-protection. 
The BAR416 fragment, which includes amino acids 192-299, could 
elicit protection from fatal infection with a single challenge 
strain WU2. This Example shows the ability of BAR416 
immunization to protect against the 6 strains that had been best 
protected against by immunization with BClOO. Immunization with 
the BAR416 construct resulted in increased time to death for all 
6 challenge strains examined (Table 5) . BAR416 provided 
significant protection against death with WU2 , A66, BG7322 and 
EF6796 pneumococci (capsular types 3, 3, 6B and 6A respectively). 
It also prolonged the lives of mice challenged with ATCC6303 and 
DBL6A pneumococci (capsular types 3 and 6A respectively) . Serum 
from mice immunized with the BAR416 fragment yielded a geometric 
mean reciprocal anti-PspA ELISA titer to full-length Rxl PspA of 
750. Mice immunized with BClOO had geometric mean reciprocal 
titers of close to 2000, while non-immunized mice had anti-PspA 
titers of <10. 

The above data indicates that the BAR416 fragment from 
Rxl elicits adequate cross-reactive immunity to protect against 
diverse pneumococci and suggests that this region must be 
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serologically conserved among PspAs. This hypothesis was 
confirmed by immunized with recombinant BAR416 homologous regions 
from the 7 different clones and then challenging with strain WU2 
(Table 6). All 7 immunogens elicited significant protection. 
PspA fragments from capsular types 2 and 22 and the rough R36A 
strain elicited complete protection against death with all 
challenged mice. All of the other immunogens were able to extend 
the time to death of all the mice with the median days to death 
being 21 days or >21 days. Serum from mice immunized with the 
BAR416 homologous fragments had anti-PspA reciprocal titers that 
ranged from 260 to 75,800 with an average of 5700 while control 
animals immunized with only maltose binding protein had anti-PspA 
reciprocal titers of <10. 

Antibody reactivities. All of the above immunization 
studies attest to the cross-reactivity of epitopes encoded by 
amino acids from position 192-299, This region includes the C- 
terminal third of the a-helical region and the first amino acids 
of the proline rich region. Other evidence that epitopes within 
this region are cross-reactive among different PspAs comes form 
the cross-reactivity of a panel of nine MAbs all of which were 
made by immunization with Rxl PspA. The epitopes of four of the 
antibodies in the panel reacted with epitopes mapping between 
amino acids 192-260. The epitopes of the other five MAbs in the 
panel map between amino acids 1 and 115 (McDaniel, L.S., et al., 
Microb. Pathog. 1994; 17: 323). Each of these 9 MAbs were tested 
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for its ability to react with 8 different PspAs in addition to 
Rxl* The 5 MAbs whose epitopes were located within the first 115 
amino acids, reacted on average with only 1 other PspA. Three of 
the 5 in fact, did not react with any of the other 8 PspAs. In 
contrast the MAbs whose epitopes map between 192 and 260 amino 
acids each cross-reacted with an average of 4 of the 8 non-Rxl 
PspAs, and all of them reacted with at least two non-Rxl PspAs. 
Thus, based on this limited section of individual epitopes, it 
would appear that epitopes in the region from 192-260 amino acids 
are generally much more cross-reactive than epitopes in the 
region from 1-115 amino acids. 

The BClOO fragment of Rxl PspaA can elicit protection 
against the encapsulated type 3 strain WU2, Although the PspAs 
of the two strains can be distinguished serologically they are 
also cross-reactive (Grain, et al.. Infect, Immun. 1990; 

58 : 3293) • The earlier finding made it clear that epitopes 
cross-protective between Rxl and WU2 PspAs exist* The importance 
of cross-reactions in the region C-terminal to residue 192 is 
demonstrated in this Example where 13 mouse virulent challenge 
strains have been used to elicit detectable protection against 
all of them. The first indication that epitopes C-terminal to 
position 192 might be able to elicit cross-protection came from 
our earlier study where we showed the MAbs Xi64, XiR278, XiR1323, 
and XiR1325, whose epitopes mapped between amino acids 192 and 
260 of strain Rxl PspA, could protect against infection with WU2, 
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Moreover, immunization with PspA fragments from 192-588 and 192- 
299 were able to elicit protection against infection against WU2. 
This Example shows that the BClOO Rxl fragment (192-588) elicits 
significant protection against each of 13 different mouse 
virulent pneumococci, thereby firmly establishing the ability of 
epitopes C-terminal to position 192 to elicit a protective 
response. The observation that a fusion protein containing amino 
acids 192-299 fused C-terminal to maltose binding protein could 
also elicit cross-protection, permits the conclusion that 
epitopes in this 107 amino acid region of PspA are sufficient to 
elicit significant cross-protection against a number of different 
strains. 

Evidence that a comparable region of other PspAs is 
also able to elicit cross-protection cam from the studies where 
sequences homologous to the 192-299 region of Rxl PspA were made 
from 5 other PspAs. All 5 of these fragments elicited 
significant protection against challenge with strain WU2. These 
data provide some suggestion for serologic differences in cross- 
protection^ elicited by the 192-299 region. 

Based on present evidence, without wishing to be bound 
by any one particular theory, it is submitted that the PspAs in 
strains D39, Rxl and R36A are identical. All of the 9 mice 
immunized with the 192-299 fragments from R36A and D39 survived 
challenge with WU2 . Only LMIOO, one of the non-R36A/D39 PspAs, 
protected as high a percentage of mice from WU2 . The difference 
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in survival elicited by the R36A/D39 PspAs and the non-Rxl 
related PspAs was statistically significant. 

The data does indicate however, that all of the 
differences in protection against different strains are not due 
to differences in serologic cross-reactivity. BClOO, which is 
made from Rxl, protected against death in 100% of the mice 
challenged with 7 different strains of S. pneumonia, but only 
delayed death with strain D39, which is thought to have the same 
PspA as strain Rxl. Thus, some of the differences in cross- 
protection are undoubtedly related to factors other than PspA 
cross-reactivity, whether such factors are related to 
differences in virulence of the different strains in the 
hypersuceptible Xid mouse, or differences in requirements for 
epitopes N-terminal to amino acid 192, or difference in the role 
of PspA in different strains is not yet known. 

These results suggest that a vaccine containing only 
the recombinant PspA fragments homologous with Rxl amino acids 
192-299 is effective against pneumococcal infection. Moreover, 
the results demonstrate that utility of PspA a. a. 192-299, a.a. 
192-260 and DNA coding therefor, e.g. primers N192 or 588 
(variants of LSM4 and LSM2) as useful for detecting the presence 
of pneumococciae by detecting presence of that which binds to the 
amino acid or to the DNA, or which is amplified by the DNA, e.g., 
by using that DNA as a hybridization probe, or as a PGR primer, 
or by using the amino acids in antibody-binding kits, assays or 
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tests; and, the results demonstrate that a. a. 192-299 and a. a. 
192-260 can be used to elicit antibodies for use in antibody- 
binding kits assays or tests • 
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labktf Protection of mice by iinmuni2aUon wiih BClOO from Rxl PspA 



Challenge Capsule PspA 
Strain* type type 



D39 
WU2 
ATCC6303 

A66 
EF10197 
EF5668 
EE3296 
LS1905 
BC9739 
DBL5 
BG7322 
EF6796 
DBL6A 



2 
3 
3 
3 
3 
4 
4 
4 
4 
5 
6 

6A 
6A 



25 

1 

7 

13 

16 

12 

20 

23 

26 

33 

24 

1 

19 



BClOOlmmunogen 
» alive % Median day 



/ ^dead Survival 



Controls 



0/5 

4/0 

5/0 

4/0 

5/0 

1 /3 

1/3 

1/4 

0/4 

0/5 

4/0 

4/0 

5/0 



0% 
1(30% 
100^ 
100% 
100% 
25% 
25% 
20% 
0% 
0% 
100% 
100% 
100% 



ajtyc 

5 
>21 
>21 
>21 
>21 
9 
5 
4 
6.5 
5 

>21 
>21 
>21 



alive % Median days 
/^dead Surv ival alive 



0/3 


0% 


2 


0/3 


0% 


3 


0/5 


0% 


7 


0/3 


0% 


1 


0/3 


0% 


2 


0/3 


0% 


4 


0/3 


0% 


3 


0/6 


0% 


2 


0/3 


0% 


2 


0/3' 


0% 


2 


1/2 


33 J % 


6 


0/3 


0% 


1 


0/3 


0% 


7 



p 

Values 

0^22 
a002 
a004 
0.03 
0J02 

N.S. 

N.S. 

0.02 

N.S. 

0.02 

0i)3 

0*03 

Oi}3 



5p\;Jl7™7 wiLu approximaiciy 1(P CFU/mL of each strain 

»P^values were based on comparison of days alive by a one-tailed V^coxon 2 sample-rank 



lahl 



Challenge Capsiile 
Strain type 


PspA 
type 


I BAR416Imznunogen 


Controls 


P 


w alive 
1 /;rdead 


% 

Survival 


Median days 
alive 


w alive 
/^dead 


% 

Survival 


Median days 
alive 


Value § 


VVU2 


3 


1 


4/1 


80% 


>21 


0/3 


0% 


1 


0.002 


ATCC6303 


3 . 


7 


2/3 


40% 


13 - 


1/4 


20% 


4 


0.018 


A6d 


3 


13 


5/0 


100% 


>21 


0/5 


0% 


2 


0.004 


BG7322 


6 


24 


3/2 


60% 


>21 


0/4 


0% 


7 


om 


EF6796 


6A 


1 


3/2 


60% 


>21 


0/5 


0% 


5 


0.004 
0.008 


DBL6A 


6A 


19 


0/5 

1 ?^t^ • 


0% 


7 


0/5 


0% 


2 



SPw^n-T ™..— wiui aoouc lu-' CFU of each strain 

*P values were based on comparison of days alive by a one-tailed Wilcoxon 2 sample-rank 
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Table Protection of mice against S 
BAR416 Analogs of 7 PspAs 




•Rvalues were based on comparison of days aUve by a one-tailed WUcoxoa 2 saznple-raic 

j~ products amplified 

LSM4 and LSM6 J^stdZo^^^Zl^ ^! reactionf with 

L81905, EF6796, EFSeS^TcTSf^^tV^ST*' s«e W strains BG9163, WU? 



lahU^ Reactivity of MAbs with PspAs of Different FneumornrH 
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f^A. ND. ».a» no. d«.rndn=d. Mapping cf epitopes v«3 .o 
fra^ents of strain Rxl PspA 
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EXAMPLE 3 - Isolation of PspA and Truncated Forms 
Thereof, and Immunization Thereby 

PspA is attached to the pneumococcal surface through a choline binding site on PspA. 
This allows for successful procedures for the isolation of FL-PspA. PspA can be released from 
the surface of pneumococci by elution with 2 percent choline chloride (CC), or by growth in a 
chemically defined medium (CDM) containing 12 percent CC (CDM-CC) or medium in which 
the choline had been replaced by etbanolamine (CDM-ET). Since CDM-ET supematants lack 
high concentrations of choline, the P^A released into them can be adsorbed to a choline (or 
choline analog) column and isolated by elution from the column with 2 percent choline chloride 
(CQ. 

This Example describes the ability to obtain PspA by these procedures, and the abUity of 

t 

PspA obtained by these procedures to elicit protection in mice against otherwise fatal 
pneumococcal sepsis. Native PspA from strains R36A, RXl, and WU2 was used because these 
strains have been used previously in studies of the ability of PspA to elicit protective immunity 
(jec, e.g.. Examples infra and supra) . The first MAbs to PspA were made against PspA from 
strain R36A and the first doned fragments of PspA and PspA mutants came from strain Rxl. 
Strain Rxl was derived from strain R36A, which was in turn derived from the encapsulated type 
2 strain, D39. PspAs from these three strains appears to be indentical based on serologic and 
molecular weight analysis. Molecular studies have shown no differences in the pspA genes of 
strains D39, Rxl, and R36A. The third strain that provided PspA in this Example is the mouse 
virulent capsular type 3 strain WUZ Its PspA is highly cross-reactive with that from R36A and 
Rxl, and immunization with Rxl and D39 PspA can protect against otherwise fatal infections 
with strain WU2, 

S* pneumoniae 

Strains of 5. pneumoniae used m this study have been described previously (Tabled I Bacteria were 
grown in cither Todd-Hcwiit broth with 0.5 percent ycisi extract (THY), or a chenucaliy defined medium 
(CDM) described previously Serial passage of stock cultures was avoided. Strains were maintained 

frozen in THY + 20 percent glycerol and cultured from a scraping of the frozen cultiu^. 
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Recovery ofPspA from pneumococci 

PspA is not found in the medium of growing pneumococci unless they have reached stationary phase 
and autolysis has commenced To release PspA from pneumococci three procedures were used. In one 
approach were grow pneumococci in 100 ml of THY and collect the cells by centrifugation at mid-log phase. 
The pellet was washed three limes in lactated Ringer's solution (Abbot Lab. North Chicago, IL). suspended in a 
small volume of 2 percent choline chloride in phosphate buffered saline (PBS) (pH 7.0)» incubated for 10 
minutes at room temperature, and centrifuged to remove the whole pneumococci. From imraunoblots with anti- 
PspA MAb XiI26 at serial dilutions of the original culture* the suspended pellet, and the supernatant, it was 
evident that this procedure released about half of the PspA originally present on the pneumococci. Analysis of 
silver stained polyacrylamide gels showed this supernatant to contain proteins in addition to PspA 

The CDM used in the remaining two procedures was modified from that of Van der Rijn For normal 
growth it contained 0.03% CC. To cause PspA to be released during bacterial growth, the pneumococci were 
grown in CDM containing 1.2 percent choline chloride (CDM-CQ, or in CDM containing 0.03 percent 
cthanolamine and only 0.000,001 percent choline (CDM-ET). In media lacking a normal concentnition-of 
choline the F-antigen and C-polysaccharide contain phosphoethanolamine rather than phosphocholine In 
CDM-CC and CDM-ET, PspA is released from the pneumococcal surface because of its inability to bind to the 
cholines in the lipoteichoic acids In addition to releasing PspA from the pneumococcal surface, growth in 
CDM-CC or CDM-ET facilitates PspA isolation by its other effects on the cell wall. In these media 
pneumococci do not autolyse thus permitting them to be grown into stationary phase to maximize the yield 
of PspA. In these media scptation does not occur and the pneumococci grow in long chains As the 

pncmococci reach stauonaor phase Aeydi.. cease nuWng PspA, and n.pi^^ Preliminaiy studies, 

using serial dilution dot blots to quantitatc imiicated that the production of PspA ceases at about the time 
the pneumococci begin to settle out. with the fonnation of visible strands of the condensed pneumococcal 
chains. When the pneumococci began to setde out. the medium was recovered by centrifugation at 2900 x g 
for20 minutes, and filtered with a low protein-binding filter (.45m Nalgcne Tissue Culture Filter #158-0045). 

For growth in CDM-CC or CDM-ET. the pneumococci were first adapted to the defined medium and 
then, in the case of CDM-ET. to very low choline concentrations. To do this, strains were first inoculated into 1 
part of THY and 9 parts of CDM medium containing 0.03 percent choline and 0.03 percent ethanolamine. 
After two subsequent subcultures in CDM containing 0.03 percent choline and 0.03 percent ethanolamine (O.I 
ml of culture + 0.9 ml of prc-wamied fresh medium), the culture was used to inoculate CDM with only 0.003 
percent choline (and 0.03 percent ethanolamine). TT.ese steps was repeated until the strain would grow in 
CDM-ET containing 0.000.001 percent choline and 0.03 percent ethanolamine. It was critical that cultures be 
passed while in exponential growth phase (at about 10^ CFU/ml). Even trace contamination of the medium by 
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exogenous choline resulted in the failure of the PspA to be released from the pneumococcal surface ^. Thus, 
disposable plastic ware was used for the preparation of CDM-ET media and for growth of cultures. Once a 
strain was adapted to CDM-ET it was frozen in 80 percent CDM-ET and 20 percent glycerol at -80» C. When 
grown subsequently the strain was inoculated directly into the CDM-ET. 

Isolation of native (fbll-Iength) PspA 

PspA was isolated fiom the medium of cells grown in CDM-ET using choline-Sepharose prepared by 
conjugating choline to epoxy-activated Sepharose 50. ^ separate column was used for media from different 
strains to avoid cross<ontamination of their different PspAs. For isolation of PspA from clarified CDM-ET. wc 
used a 0.6ml bed volume of choline-Sepharose. The column bed was about OJ cm high and 1 .4 cm in diameter. 
The flow rate during loading and washing was approximately 3 mVmin. After loading 300 ml CDM-EI 
supernatant, the column was washed 10 Umes with 3 ml volumes of 50 mM Tris acetate buffer. pH 6.9 
containing 0.25 M NaQ (TAB). The washed column was eluted with sequential 3 ml volumes of 2 percent CC 
in TAB. Protein eluted from the column Was measured (Bio-Rad protein assay. Bro-Rad. Hercules. CA). The 
column was monitored by quantitative dot blot. The loading material, washes, and the eluted material were dot 
blotted (1^) as undiluted. 1/4. 1/16. 1/64. 1/256. and 1/1024 on nitrocellulose. The membranes were then 
blocked with 1 percent BSA in PBS. incubated for 1 hr with PspA-specific MAbs Xil26 or XiR278. and 
developed with biotinylated goat-anU-mouse Ig. alkaline phosphatase conjugated streptavidin (Southern 
Biotechnology Associates Inc. Birmingham. AL). and nitrobluetetrazolium substrate with 5-bromo 4-chloio-3- 
indoyl phosphate ;,-toluidine salt (F«her Scientific. Norcross GA) 17 -^^ p^^ty of eluted PspA was assessed 



by silver-stained (silver stain kit. Bio Rad. Hercules, CA) SDS-PAGE gels run as described previously 
Immunoblots of SDS-PAGE gels were developed with MAbs Xil26 and XiR278 



32 



Isolatioa of 29 kDa PspA 

The 29 kDa fragment comprising the N-terminaf 260 amino acids of PspA was produced in DHI £1 colt 
from pJY4306 An overnight culnirc of JY4306 was grown in 100 ml of Luna Broth (LB) containing 

SO^g/ml ampicillin. The culture was grown at 37<* C in a shalcer at 225 rpm. This culture was used to inoculate 
6 one liter cultures that were grown under the same conditions. When the culture O.D. at 600 nm reached 0.7, 
12 grams of cells, as a wet paste, were harvested at 4° C at 12,000 xg. The pellet was washed in 10 volumes of 
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25 mM Tru pH 7.7 at 0^ C and suspended in 600 ml of 20% sucrose, 25 mM Tris pH 7.7 with 1 0 mM 
cihylenediamine tctraacctic acid (EDTA) for 10 minuics. The cclb were pelleted by centrifugation (8000 xg) 
and rapidly suspended in 900 ml of 1 percent sucrose with 1 mM Pefabloc SC hydrochloride (Bcehringer 
Mannheim Coq>., Indianapolis, IN.) at 0<» C. The suspension was pelleted at 8000 xg at 4o C for 15 minutes and 
the PspA-containing supernatant (periplasmic cxtraa) recovered. The recombinant PspA was precipitated 
from the periplasmic extract by 70 percent saturated ammonium sulfate overnight at 4^ C The precipitated 
material was collected by centrifugation at 12.000 xg at 4<» C for 30 minutes. The precipitated protein was 
rcsuspended in 35 ml of 20 mM histidinc 1 percent sucrose at pH 6.6 (HSB). Insoluble materials were removed 
at KOOO xg at 4° C for 10 minutes. The clarified material was dialyzed versus HSB, passed through a 0.2jim 
filter and further purified on a 1 ml MonoQ HR 5/5 column (Pharmacia Biotech, Inc., Piscaiaway, N J) 
equilibrated with HSB. The clarified material was loaded on the column at 1 ml/min, and the column was 
washed with 10 column volumes of HSB. The column was then eluted with a gradient change to 5 mM NaCl 
per minute at a flow rate of I ml/min. As detected by immuno blot with Xil26, SDS-PAGE and absorbance, 
PspA eluled as a single peak at approximately 0^7 to 030 M NaCl. By SDS-PAGE the material was 
approximately 90 percent pure. The yield from 6 liters of culture was 2 mg (Bio-Rad protein assay) of 
recombinant PspA. 

Growth of pneamococd for challenge 

Mice were challenged with log-phase pneuraococci grown in THY. For challenge* the pneumococci 
were diluted directly into lactaied Ringer's without prior washing or centrifugation. To inject the desired 
numbers of pneumococci, their concentration in lactated Ringer's solution was adjusted to an O.D. of about 0.2 
at 420 nM (LKB Ultrospec III spectrophotometer). The number of pneumococci present was calculated at 
5xl08 CPU per ml / O* D. and confirmed by colony counts (on blood agar) of serial dilutions of the inoculum. 

Immtinization» challenger and bleeding of mice 

CB A/CAHN/XID/J (CBA/N) and BALB/cByJ (BALB/c) mice were purchased from Jackson Laboratory 
Bar Harbor, ME. Mice were given two injections two weeks apart and challenged i.v. two weeks later. 
Injections without CPA were given intrapcrtioneally in a 0.1 ml of Ringers. Where indicated, the first injection 
was given in complete Fretmd's adjuvant (CPA) consisting of approximately a 1:1 emulsion of antigen solution 
and CPA oil (Difco, Detroit MI). Antigen in CPA was injected inguinally in 0.2 ml divided between the two 
hind legs. All mice were boosted i.p. without adjuvant. When mice were injected with media supematants or 2 
percent choline chloride eluates of whole bacteria, the amounts of material injected were expressed as the 
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volume of media from which the injected material was derived. For example, if the clarified medium from 
pncumococci grown in CDM-CC or CDM-ET was used for immunization without diluuon or concentration, the 
dose was described as lOOjiI. If the material was first diluted 1/10, or concentrated 10 fold, the dose was 
referred to as 10 or 1000 ^1 respectively. 

EUSA for antibodies to PspA 

Specific modifications of previously reported ELBA conditions 17 arc described. Microtitration plates 
(Nunc Maxisoip, RG.C Scientific. Gailhersbufg MD.) were coated with undiluted supematants of Rxl and 
WG44.1 pncumococci grown in CDM-ET or I percent BSA in PBS. Mice were bled reiroK)rbitally (75 >il) in a 
hepanmizcd capillary tube (Fisher Scientific. Fair Lawn, NJ.) TTic blood was immediately diluted in 0 J ml of 
one percent bovine serum albumin in PBS. The dilution of the resultant sera was 1/15 based on an average 
hematocrit of 47 percent The sera were diluted in 7 three fold diluUon in microtitration wells starting at 1/45. 
Mab Xil26 was used as a positive control. The maximum reproducible O. D. observed with Xil26 was defined 
as -maximum O. D," The O. D. observed in the absence of immune sera or MAb was defined as "minimum O J).- 
Antibody titers were defined as the dilution that gives 33 percent of maximum O, D. The binding to the Rxl 
CDM-ET coated plates was shown to be PspA-specific, since in no case did we observe ^3 percent of 
maximum binding of immune sera or Xil26 on plates coated with WG44.1 CDM-ET or BSA. 

Statistical analysis. Unless odicrwise indicated P values refer to comparisons using the Wilcoxin two-sample 
rank test to compare the numbers of days to death in different groups. Mice alive at 21 days were assigned a 
value of 22 for the sake of calculation. P values of >0.05 have been regarded as not significant. Since we have 
never observed immunixation with PspA or other antigens to make pncumococci more susceptible to infection 
the P values have been calculated as single tailed tests. To determine what the P value would have been if a 
two tailed test had been used the values given should be multiplied by two. In some cases P values were given 

for comparisons of alive versus dead. These were always calculated using the Fisher exact test. All statistical 
calculations were carried out on a Macintosh computer using InStat (San Diego, CA). 
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FspA is the major protection-eiidting coinponent released from paeumococd grown in CDM-ET or 
CDM^CC, or released from conventionally grown pncumococci by clution with 2% CC. 

PspA*conEaining preparadoos from pncumococci were able to protect mice from fatal sepsis following 
i.v. challenge with 3x10^ (100 x LO50} capsular type 3 S. pneumoniae (Table9 J. Comparable preparations 
from the strains unable to make PspA (WG44. 1 and lYll 19), or unable to make full length PspA (LM34 and 
JY2141) were unable to elicit protection. Regardless of the method of isolation the minimum protective dose 
was derived from pneumococci grown in from 10 - 30 of medium. We also obsetved 9 that supematants of 
log phase pncumococci grown in normal THY or CDM media could not elicit protection (data not shown). This 
finding is consistent with earlier studies indicating the PspA is not normally released in quantity into the 
medium of growing pneumococci. 

Isolated PspA can eUdt protection against fatal infection 

Although PspA was necessary for these preparations to elicit protection it was possible that it did not ac; 
alone. Mice were thus, immtmized with purified FL-PspA to address this question. 

Isolation o/FL-PspAfivm CDM-ET growth medium. We isolated the FL-PspA from CDM-ET rather 
dian from CDM-CC medium or a 2 percent choline chloride clution of live ceils, because the high levels of 
choline present in the latter solutions prevents adsorption of the PspA to the choline residues on the choline- 
Sepharose column. PspA for immtmization was isolated from strain R36A. as the strain is non-encapsulated and 
the isolated PspA could not be contaminated with capsular polysaccharide. As a control we have conducted 
mock isolations from WG44.1 since this strain has an inactivated pspA gene and produces no PspA. The results 
shown in Tci^/fiiO^ire typical of isolations from 300 ml of CDM-ET medium from R36A grown pncumococci. 
We isolated 84 jtg of PspA from 300 ml of medium* or about 280 ^tter. fiased on the dot blot results this 
appears to be about 75% of the PspA in the original medium; and that CDM-ET from R36A cultures contains 
about 400 fig/liier of PspA, o; about 0.4 jig/ml 

No serologically detectablc-PspA was seen in the CDM-ET from WG44.I cultures. More significandy 
there was undetectable protein recovered from the choIinc-Sepharose column after adsorption of CDM-ET from 
a WG44,l culture* indicating that PspA is the only protein that could be isolated by this procedure. Moreover 
by silver stained SDS PAGE gel the PspA isolated from R36A appeared to be homogenous (Figure,3l 
Although autolysin can also be isolated on choline-Sepharose we did not expect it to be isolated by this 

procedure since autolysin is not released from pneumococci grown in choline deficient medium The 
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immunologic purity of the isolated PspA was emphasized by the faa that immunization with it did not elicit any 
antibodies detectable on plates coated with CDM-ET supematants of WG44.1. 

Loading more than 300 ml on the 0.6 ml bed volume column did not result in an increased yield, which 
suggested that the column capacity had been reached. However, increasing the depth of the choline- 
Sepharose bed to greater than OJ cm. decreased the amoom of PspA eluied from the column, presumably 
because of non-specific trapping of aggregates in the column matrix. The elmion buffer contains 50 tnM Tris 
acetate 0.25 M NaQ and 2% choline chloride. Bution without added NaQ or with IM NaQ resulted in lower 
yields. Bution with less than 1% CC also reduced yields. 

Immuniztttien ofmice mth purifted R36A PspA. For immunization we used only the first 3 ml fraction 
of the R36A column. Mice were immunized with two injecUons of 1. 0.1. or O.OI jig of R36A PspA. spaced two 
weeks apart As controls, some mice were inoculated with a comparable dUutions of the first 3 ml fraction from 
the WG44.1 column. Purified R-PspA elicited antibody to PspA at all doses regardless of whether CFA was 
used as an adjuvant aabUHj. In the absence of CFA the highest levels of antibody were seen with the I pg 
dose of PspA. In the presence of CFA. however, the O.l^ig dose was as immunogenic as the 1 ^g dose. 

To test the abiUty of the different doses of PspA to elicit protection agamst challenge we infected the 
immunized mice with two capsular type 3 strains. WU2 and A66. Aldiongh both of these strains are able to kffl 
highly susceptible CBAm XID mice at challenge doses of less than 102. the A66 strain is several logs more 
virulent wdon BALB/c mice are used ''7. 52, -j^ difference in virulence of A66 and WU2. was partially 
compensated for by challenging the immunized CBA/N mice with lower doses of strain A66 than WU2. 

After immunization of CBA/N inice with 1 and 0. 1 fig doses of PspA we observed protecUon against 
WU2 challenge regardless of whether or not CFA was used as an adjuvant (Table 4). At the lowest dose. 0.01 
Hg PspA. most of the mice immunized with PspA + CFA lived whereas most immunized with PspA alone did 
not; however, the difference was not statistically significant. When immunized mice were challenged with the 
more virulent strain A66 53, jmvivors were only observed among mice imrottnized with the 1 and 0.1 |ig 
doses, niere was slightly, more protection against fatal A66 infecUon among mice immunized with CFA than 
without, but the diff(ience was not statistically significanL When the two sample rank test was used to analyze 
the time to death of mice infected with A66 we observed a statistically significant delay in the time to death in 
each immunized group as compared to the pooled controls. 
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The 29 kDa N-terminal fragment of PspA can didt protection against infection when ix\f ected with CPA 

We have compared the immunogcniciiy. with and without CFA» of an isolated 29 kDa fragraem 
composed of the first 260 amino acids of PspA. Uniiice the case with FL-PspA» adjuvant was required for the 29 
IcDa fragment to elicit a protective response. This was observed even though the immunizing doses of the 29 
kDa antigen used were 10 and 30^g/mouse, or about 100 and 300 times the minimum dose of FL-PspA that can 
elicits protection in the absence of adjuvant 

IiUection with CFA revealed the presence of addldonal protection eliciting antigen($) in CDM*CC; and 
CDM-ET growth mediom but not in the 2 percent choline chloride eluates of live cells 

The observation that Freund*s adjuvantcould have such a major effea on the immunogenicity of the 29 
kDa fragment (TabU12)^ prompted us to reexamine the immunogens described in TabU 2 to determine if 
imnnmizauon with adjuvant might enliance protecdon elicited by PspA-containing preparations or provide 
evidence for protection eliciting antigens in addition to PspA. By using CFA with the primary injecuon» the 
dose of PspA-coniaining growth medium (CDM-CC and CDM-ET) required to elicit protection was reduced 
from 10 '20 111 (Tabk9) down to 1 to 3 (TabU13). When CFA was used as an adjuvant with CDM-CC and 
CDM-ET from PspA- strains WG44J and JYl 1 19 we were able to elicit protective immune responses if material 
from ilOOjil or more of media were injected Thus, although there were q^parcntly some protecdon ellcitmg 
con^ncnts other dian PspA in CDC-CC and CDM-ET growth media, PspA remained the major protection 
eliciting component even in the presence of adjuvant 

One of the media used for injection was CDM-ET in which JY2I41 had been grown. This medium 
elicited protection against WU2 challenge even when injected at doses as low as 1 It should be noted that 
although this strain does not make full-length PspA, it secretes a truncated molecule comprising the first 1 15 
amino adds of PspA into the growth medium. Thus, unlike CDM-EI from WG44. 1 and JYl 1 19, CDM-ET from 
JY214 1 has the potential to elicit PspA-spccific immunity. In contrast to these results, the material eluted from 
JY2141 with 2 pcrcem CC was relatively non-immunogenic even when emulsified with CPA. This result is 
consistent widi the faa that the 115 aminoaddN-tcrniinalPspAfragmentofJY2I41 is not surface attached ^7 
and would be expected to be washed away prior to the elulion with 2 percent CC 
Extension of stadics to BALB/c mice and i.p. chaUenge route 

The snidies above all involve Lv. challenge of CBA/N mice expressing with the XID genetic defect The 
i.v. route, used in the present studies provides a relevam model for bacteremia and sepsis, but pncumococci 
have higher LDsqs when injected i.v. than i.p. CBA/N mice are hypcrsusccptible to pneumococcal infecdon 
because of the XID defect This genetic .defect prevents them from having circulating naturally occurring 
antibody to phosphocholinc. The absence of these antibodies has been shown to make XID mice several logs 
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more susceptible to pnctimococci than isogenic mice lacking the immune defect. From the data in TablelAli is 
clear, however, that immunization wirh PspA can protect against infection in mice lacking the XID defect even 
when the challenge is by the Lp. route. Thus, there is no reason to suspect that the results presented arc 
necessarily dependent on the use of the CB A/N XID mouse or the i.v. route. 

PspA Is highly inuDonogenic 

These studies provide the first quantitative data on the amount of purified FL-PspA that is required to 

elicit protective inununity in mice. The isolated PspA for these smdies was obtained by taking advantage of the 

fact that the C-terminal half of PspA binds to cell surface choline The isolated FL-PspA was found to be 

highly immunogenic in the mouse. Only two injections of 100 ng of PspA in the absence of adjuvant were 

required to elicit protection against otherwise fatal sepsis with greater than 100 LD50 of capsular type 3 51 

pneumoniae. When the first injection was given with adjuvant, doses as small as 10 ng could elicit protective 

re^nse. The potent inrniunogenidty of PspA, and the ability to isolate it on choline-Sehparosc 
columns provides a dempnstration for the possible use of PspA as a vaccine in humans. 

A large body of published 29, 37 35 ^.^h ^ unpublished evidence indicates that the major protection 
eliciting epitopes of PspA are located in the a-helical (N-tcrminal) half of the molecule. From the present 
studies, it is clear that immunization with N-tcrminal fragments containing the first 1 15 or 260 of the 288 amino 
add a-helical region arc able to elicit protection when given with CFA, However, these fragment were notable 
to elicit protective responses without CTA. In the case of the both the 1 15 and 260 amino acid fragments, even 
immunization at 100 times the mimmum dose that is immunogenic for FL-PspA faUcd to elicit a protective 
response. TTiis result is consistcm with previous results showing that a fragment composed of the N-terminal 
245 amino acids 3 1. 37 ^ould elicit protection against otherwise fatal pneumococcal infection of mice when the 
immunization was given with CFA^^ ^ that study no immunization without CFA was attempted. Even 
though the C-terrainal half of PspA may not contain major protection-eliciting epitopes it appears to contain 
sequence important in the immunogenicity of the molecule as a whole, since the full length molecule elicited 
much greater protection than the N-tcrminal fragments: TTie effect of die C terminal half on antigenicity may be 
in p?n that it doubles the size of the immunogen. Molecules containing the C-terminal half of PspA may also 
be especially immunogenic because they exhibit more extensive aggregation than Is seen with fragments 
expressing only the a-hclical region 38. Pfotein aggregates arc known to generally be more antigenic and less 
tolerogenic than individual free molecules 
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PspA Is the major protection elidting component of our pneumococcal extracts 

Evidence that PspA is the major protection cliciung component of the CDM-ET, CDM-CC gro\^ih media 
and the two percent CC eluates was dependent on the use of mutant pneumococci that lacked the ability to 
produce FL-PspA« More than one pspA mutant strain was used to Insure that the failure to elicit protection in 
the absence of FL*Psp A was not a spurious result of non-PspA muution blocldng the production of some other 
antigen. Strains WG44.1 and JY1119 contain identical deletions that include the 5* end of the pspA genes and 
extend about 3 kb upstream of pspA^'^. WG44.1 is a mutant of the non-cncapsulated strain Rxl and JYl 119 
was made by transforming c^sular type 3 strain WU2 with the WC44.1 pspA mutation. In no case were 
preparations from WG44. 1 and JYI 1 19 as efficient at eliciting protection as those from the PspA^ strains. To 
role out the possibility that protection elicited by preparations from the PspA**" strains was elicited by some non- 
PspA molecule also encoded by a 3 kb deletion linked to the mutam pspA genes of WG44. 1 and JYl 1 19, we 
also used strains JYllAl and LM34 In these strains the Rxl pspA gene has been insertionally 

inactivated causing the production of N-temunal fragments of 1 15 and 245 amino acids respectively. These 
strains have no other known mutations. Although Rxl and R36A are closely related non-encapsulated strains, 
some of the studies included Rxl as the PspA+ control since it is the isogenic parmer to WG44, 1, LM34, and 
JY2141. The N terminal fragments produced by JY2I41 and LM34 lack the surface anchor and are secreted 
into the medium Two percent CC eluates of JY2141 were non-protection eliciting even in the presence of 
adjuvanL In the absence of adjuvant, CDM-ET from JY2141 was not protection-eliciting. LM34 was tested 
without CPA in only 3 mice, but gave results consistent with those obtained with JY2141. 

Anticapsular antibodies are known to be protective against pneumococcal infection However, in 
these studies it is unlikely that they account for any of the protection we attributed to PspA. Our challenge 
strain bore the type 3 capsular polysaccharide and our primary source of PspA was strain R36A, which is a 
spontaneous non-encapsulated mutant of a capsular type 2 strain 3'' The R36A strain has been recently 
demonstrated to lack detectable type 3 capsule on the surface or in its cytoplasm Fuithermore, the CBA/N 
mice used in most of the studies are unable to make antibody responses to capsular type 3 polysaccharide 

Non-FspA protection eliciting components 

The observation that CDM-CC and CDM-ET supematants of WG44,I could elicit protection when 
injected in large amounts with adjuvant, suggested that these supematants contained at least trace amounts of 
non-PspA protection eliciting molecules. In the case of preparations containing PspA eluted from the surface of 
live washed pneumococci with 2 percent CC, there was no evidence for any protection eliciting components 
other than PspA, presumably because the protection-eliciting non-PspA proteins released into the media were 
removed by the previous washing step. The identity of the protection eliciting molecules in the WG44.1 
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»p=«un. « I" -n"^ » of lnu». *». «.Uk. R36A. R,, to ^ „ 

CDM^ „ CDM.Er. «b p,«™ ^ ^ 57 ^ ^ 

-c. mad. p«^„ „ ^^^^ "'i^l^.^^xmu^^ 

T-d^poria,, (pTOri.) «to »„„ T-indq,e.d.„, (,„|,»ccb.nd.) .^ibod, r=po,^ 
A amtoof „„.p,pAp,„«,-,„ ellddn, p„»„o.«ol pn»da. ta„ b», id«flf«d: p«.mol™„ 
•«ol,«^n»™«d«^^p^„U^„3j^3jj ,„^j,^^_^^^j, j, jj^ 

• p»««ta dicidng compon.^ ^>^^^ «„po«. of. of d«„ .i^^ 

efficacious as PspA have not been successfnl*25 . 
baladonofFspA 

•^^^"'^"ivecapacityoftheCDM-CCCDM-ErandnuuerialdmedftomU 
sunalaru^ce^^ of the voU«,eofU,e original culture £h,a,whichcheinje^ 
adva^gc Of elating the P^Afh^ntthe surface Of p„e«„ococciwith2p«^^^ 
may be pown in „y standard growth mediun. and do not have to be fust adapted to a defined mediun. 
Mo«over. concentration of PspA can be accomplished by centrifugat^^^ 

duuon of the PspA. An advanuge of using either CDM-CC and CDM-ET media was that these media 
prevented lys« and pneumococci could be grown into stationary phase without contaminating the 
P^on, with cytoplanruc contents and membrane and wall components. A particular advantage of CDM- 
ETgrowth medium is that sine, it lacks high concentrations of choUne the PspA contained in it con be 
adsorbed directly to a choHne-Sepharose column for affinity purificaUoa 

One liter of CDM-ET growth medium contains about 400 ftg of PspA. and we were able to isolate about 
3/4ofutoveryhighpurity. At O.^g^ose. a liter of CDM-ET contains enough PspA to immunize about 4.000 



noa- 

non- 
as 
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mice; or possibly 40 - 400 humans. Our present batch size for a single column run is only 300 ml of CDM-ET. 
This could presumably be increased by increasing the amount of the adsorbent surface by increasing the 
diameter of the column. Using our present running buffer we have found that a cholinc-Scpharose resin depth 
of 0 J cm was optimal; increases beyond 0 J cm caused the overall yield to decrease rather than increase, even 
in the presence of larger loading volumes of R36A CDM-ET 
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Table 6 Pneumococcal Strains 
Strain 



Capsule 
type 



PjpA 
expressed 



Parent 
strain 



Construction 
technique 



References 



D39 
R36A 

Rxl 

WG44.1 

LM34 

JY2I41 



WU2 
JY1119 

A66 



non- 
encap- 
sulated 

non* 
encap- 
sulated 

non- 
encap- 
sulated 



encap- 
sulated 



non- 
encap- 
sulated 

3 

3 



full length 
full length 

full length 

none 



clinical isolate 



io.44 



non- aa 1-245 of Rxia 
aal-II5 ofRxia 



full length 
none 

fiiU length 



D39 
R36A 
Rxl 
Rxl 
Rxl 

WU2 



non-encapsulated 23. 44, 45 
mutant 

derived from R36A 26. 39, 41 



aberrant insertion 26, 37 
inactivation with 
pKSD300 

insertional 26, 37, 42 
inactivation with 
pKSD300 

insertional 37 
inactivation with 
pJY4208 
clinical isolate 25, 46 

transformation with 37 
WG44.1DNA 
clinical isolate 44, 47 



aLM34 and 
respectively, 



LY2141 express fragments containing the first 245 and fust 1 15 amino acids of Rxl PspA 



MP 
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Table 9 PspA Is the major protection-elidting 
component in antigen preparations made by three 
different methods 





ouaUl 


J^oseas 


Mcoian 


Alive: 


P versus 


Preparation 


(PspA 


volinne 


Days 


Dead 


controls^ 


Status) 


of media 


Alive 










in Mi* 








2»CC 


R36A 


1000 


>21 


2:0 




chtuefon 




200 


>21 


2:0 




livecellj 


20 


>2I 


2:0 








2 


IJ 


0:2 








III ILXIA 


>21 


o : 2 


0X3 




JY2Ut 


1000 


3,>21 


1:1 






(u 1 - tl5) 


200 


1 


0:2 








20 


I 


0:2 




CDM-CC 


Rxl 


100 - _ 


>21 


9:0 


<0X001 






30 


>21 


2:1 








in 




1:2 








3 


2 


0 - 3 








ALL 


2->21 


11 • K 

i* « Q 






LM34 


too 


2,2.>21 


1:2 






WC44.1 


too 


2 


0:9 








30 


2 


0:3 








10 


2 


0:3 








A 


2 


0 : 3 






WW 


1000 


>21 


3:0 


0.05 




(P*pA+) 


100 


>21 










ALL 


>2l 


4:0 


0X>3 




JYni9 


1000 


4 


0:3 
















CDM-CC 


100 


2 


0:2 






R36A 


100 


>21 


8:0 


<OJ0Q0\ 






10 


3.>2I 


3:5 


0.004 






1 


IJ 


3:5 








0.1 


2 


0:2 








ALL 


>21 


16: 12 


QJ0O6 




mul 


100 


U 


0:2 




(ul-US) 


10 


IJ 


0:2 






WG44.1 


100 


3 


0:2 








10 




0:2 




Nbne 






2 


0: 14 





* Antigen dose is given as the volume of growth media from which the O.I ml of injected material was derived. 
Each mouse was injected twice i.p. with the indicated dose diluted as necessary in lactated Ringer's injcaion 
solution. 

^ Controls used for statistical comparisons: 2% COall JY2141; CDM-CC Rxl. aU WG44.I; CDM-CC WU2, 
jy 1 1 19; CDM-ET. aU WG44- 1 + all JY214L 
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TablelO Isolation of Psp A from 300ral of CDM-ET media after the growth of R36A or WC44. 1 
pneumooood* 



fraction 



R36A 



protein^ml 



total >te 
protein" 



max. 
reciprocal 
dot blotc 



total dot 

blot 
units^* d 



WG44.I 



\ig protein 
pertnl 



total \iR 
protein'^ 



max. 
reciprocal 
dot blotc 



growth media 
fail-through 
1st wash 
10th wash 
elutioa#l 
elQtion#2 
elutioa #3 
total eluted 



133 
13.6 



26 
2 
<1 



3,990 
4,080 



78 
6 

84....- 



4 
I 

<i 
<1 
256 
16 
4 



1200 
300 



770 
48 
12 
830 



13,7 
13J 



<1 
<1 
<1 



4.110 
4,050 



<1 
<l 
<l 
<I 
<1 
<I 
<l 
<I 



* TTic columns were loaded with 300 ml of clarified CDM-ET mediiim after the grow^ 1 
The column was washed with 10 sequential 3 ml fractions of TBA. Hution was with TBA plus 2 percent CC. 

D Total \ig protein or total dot blot units reflect the loul protein in the 3(K) ml of the loading maierial or the 3 ml 
size of the eluted fractions. 

c MAb XiR278 was used in the iimnunoblots to detect PspA ia dot blots. 

<'Dot blot units were calculated as the reciprocal dot blot titer times the volume in ml. 
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Table il Furified full-length PspA is able to elicit protection against fatal sepsis in mice. 



Antigen 


Owe* 


Adjuvant 


Afiti* PspA 
liter* 




ChaUenge with 
10^-i WW 




Challenge with 
10^-2 A66 






or 
DUitenc 


(Log mean 
± SJE,) 


Alive 
Dead 


Niodiao 
Days 
Alive 


VI. 

pooled 
control 


Alive : 


Median 
Oiays 
Alive 


/*n. 
pooled 

oontnls^ 


R36A 


1 MS 
0.1 

0.01 


Riofcr's 
Riflfcr'f 


3 J ± 0.2 
2.6 1 0.2 
2.7 ± 0.2 


5 ; 0 
4:0 


>21 
>2t 
4 


0.015 


2 :3 
1:4 

U . J 


4 

4 

3 


0X02 
0X032 
0.0QS8 




0.1 
0.01 


CFA 
CFA 
CFA 


3J±0^ 
3.6 ± 0.1 
3.1 ± 0.2 


3:0 
5:0 
4: 1 


>21 
>21 
>21 


a027 
0.013 
0.015 


3:2 
2:3 
0:5 


>21 
4 
3 


0X012 
0.0012 
0.0038 


WC44.1 
(ftpA-) 


3600 fU 
360 

36 


Rinfcr's 


<1.6 
<Ij6 




luL 
iLd. 
Bji. 




1 :4 

0:5 
0:5 


3 
2 
2 


OJ. 




3600 Ml 
360 
36 


CFA 
CFA 
CFA 


<1.6 
<1.6 
<1.6 


tLd, 

ILd. 

nA. 


out 
fuL 




0:5 
1:4 
0:5 


2 
2 
2 


n.t. 
a.t. 


saline 




CFA 


<1.6 


1 :5 


4 




fud. 


n.d. 




pooled 
controls 






<1.6 


1:5 


4 




2:28 


2 





* For comparison with the dau in Table 2, it should be noted that the 1, 0. 1, and 0.01 fig doses were derived 
from 3600, 360, and 36 \il of R36A growth media. Equivalent dilutions of the PspA* eluate from strain 
WG44.1 were injected as controls. The amount of the WG44.1 preparations injected is listed as 3600, 360. 
and 36 ^1 and corresponds to the volume original growth medium from which the doses of WG44.1 was 
prepared. 

^Antibody values were expressed as reciprocal ELISA titer. 

values calculated by the Wilcoxon two sample rank test By Kruskal-Wallis nonpararaetric ANO VA for the 
WU2 challenge was significant at />=0.01, for A66 significance was at P<0.0001. 



b4S 
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Table 12 The 29 kDa N-tcnninal fragment of Rxl PspA must be injected 
with adjuvant to elicit protection against WU2^ 

fig 29 kDa Adjuvant Median Days ... ^ . pTSSuT 

PspA ordUuent AUve Alive: Dead *'vei^ 



30 


CFA 


>21 


3:0 




3 


CFA 


>2I 


3:0 


j 0.0006 


30 


Riqgei's 


2 


0:3 




3 


Ringei's 


2 


1:2 




none 


CFA 


2 


0:7 




none 


Rinset's 


2 


0:7 





I," — 7 . m luai xou ammo aaas or f spA. 

^^'nS^^S!^^? °f '•^^O" *^ 30ug and 3 ng data were pooled; mice immunized with PspA + CFA were 
<TA .lont W.S 0^006 1^ b<»» ih« Witeon two SMple rai iBi ihc Fish" 
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Tablel3 FspA is not the only protection eliciting molecule released 
from pneumococd by interference with binding to choline on the 
surface of pneumococd 



PrepanuioQ 


Strain 
(P*pA 
subis) 


Dose 
(as volume 
in Ml) 


Median Alive: 
Day Dead 
Alive 


P values^ 












P vs. aU mi4l 


2ft oc 


lUdA 


1000 


>21 


2:0 




ehiiteftoo 




200 


>21 


5:0 


0.02 


livecdb 


20 


>21 


5:0 


om 








>2I 


5 :0 








JJIIU6A 


>21 


17:0 


ojooi 




JY214I 


1000 


>21 


2:0 






(a« 1 • 115) 


300 


-! 


0:2 








20 


] 


0:2 








2 


I 


0:2 








aumul 


i 


2:6 
















CDM^C 


JUt 


1000 


>21 


3:0 


0.002 






100 


>21 


3 ;0 


0.002 


+ 


W02 


1000 


>2t 


3:0 


0.002 


CFA 


(P«pA*) 


100 


>21 


3 : 0 


ojom 




3 


>2I 


3:0 


0.002 




WC44.1 


1000 


>21 


5:1 


<aoooi 




(P»pA"} 


inA 


2J 


2:4 


0.002 




JYUI9(Pq)A-) 


1000 


>21 


3:0 


0.002 






100 


>21 


3:0 


O.0O2 


CDM-ET 


R36A 


1000 


>21 


3:1 


0.004 


dsifiod 




10 


>21 


4:0 


0.004 






I 


>21 


3:1 


0.0(M 






0^ 


2 


0:4. 






JY2UI 


10 


>21 


2:0 






(aat-115) 


I 


>21 


2:0 






anmui 




>21 


4:0 


OJOM 




W044.1 


100 


>21 


2:0 






<P*pA") 


10 


2 


0:2 




CDM-ETooly 


+ CFA 




2 


0:9 




None 






IJ 


0:4 






Controlt^ 




2 


0:13 





^ In cases where there were not statistically significanfresults no P value was shown. 
*>''Poolcd Controls" refers to "CDM-ET only* Data and "None' data. 
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Tablft 14 Immunization of B ALB/c mice with isolated PspA elicits 
protection again st WU2 S. pneumoniae 

ghallcnge Days to 

LogCFU RoutT' Death 



Anueen 


or dilttcnt 


Soorce 


Dosea 




IMS 


CPA 


WC44.1 (FtpA-) 


loom 


CPA 


KOBB 




CPA 






MM 




lOOpJ 








BOM 


* ftsoiti 







l.p. 
tp. 

Lr. 
Lv. 



2.>21,>2U>21 
2.3 
2.2.2.4 

2.>2U>21.>21 
5.7 
2.2.2.3 



P VS. comrois 
TSR/FEb 



0^)670.03 



OOOtALOOOT 



^^i^teS^iSLt I00pl5inceUusls Ac volume CDM.ETfro';?which 

^tS*^™*?^? ^ ^"inplc rank tea, TSR. or Hsher exact, FE venus pooled controU for 

^sfln^H- controls- include data obtained with by injection of •WG44.1- ai5^none" TT« S 

bS. to« wi« IS r«nlts. When the data from the two studies were pooled theVvJ^ by 

both tests were S0.008. In cases where there were not statistically significant rcsuItTno P value was shown 
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EXAMPLE 4 - Evidence For Siinu1i-;. neous Exnrpssion of Tw» p,:p ^.. 

From Southern blot analysis there has been an issue as 
to whether most isolates of s. pneumoniae has two DNA sequences 
that hybridize with both 5' and 3' halves of Rxl pspA, or whether 
this is an artifact of Southern blot. When bacterial lysates 
have been examined by Western blot, the results have always been 
consistent with the production of a single PspA by each isolate. 
This Example provides evidence for the first time that two PspAs 
of different apparent molecular weights and different serotypes 
can be simultaneously expressed by the same isolate. 

Different PspAs frequently share cross-reactive 
epitopes, and an immune serum to one PspA was able to recognize 
PspAs on all pneumococci. In spite of these similarities, PspAs 
of different strains can generally be distinguished by their 
molecular weights and by their reactivity with a panel of PspA- 
specif ic monoclonal antibodies (MAbs) . 

A serotyping system for PspA has been developed which 
uses a panel of seven MAbs. PspA serotypes are designated based 
on the pattern of positive or negative reactivity in immunoblots 
with this panel of MAbs. Among a panel of 57 independent 
isolates of 9 capsular groups/ types, 31 PspA serotypes were 
observed. The large diversity of PspA was substantiated in a 
subsequent study of 51 capsular serotype 6B isolates from Alaska, 
provided by Alan Parkinson at the Arctic Investigations 
Laboratory of the Centers for Disease Control and Prevention. 
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Among these 51 capsular type 6B isolates were observed 22 
different PspAs based on PspA serotype and molecular weight 
variations of PspA. 

While most pneumococcal strains appear to have two DNA 
sequences homologous with both the 5' and 3' halves of pspA, 
site-specific truncation mutations of Rxl have revealed that one 
these, pspA, encodes PspA. The other sequence has been 
provisionally- designated as the pspA-like sequence. At present 
whether the pspA-like sequence makes a gene product is unknown - 
Evidence that the pspA and pspA-like genes are homologous but 
distinct groups of alleles comes from Southern blot analysis at 
high stringencies. Additional evidence that pspA and the pspA- 
like loci are distinct comes from studies using PGR primers that 
permit amplification of a single product approximately 2Kb in 
size from 70% of pneumococci. For the remaining 30% of 
pneumococci no amplification was observed with the primers used. 
Evidence for two PspAs ; 

When the strains of MC25-28 were examined with the 
panel of seven MAbs specific for different PspA epitopes, all 
four demonstrated the same patterns of reactivity (Fig. 4). The 
MAbs XiR278 and 2A4 detected a PspA molecule with an apparent 
molecular weight of 190 KDa in each isolate. In accordance with 
the previous PspA serotyping system, the 190 KDa molecule was 
designated as PspA type 6 because of its reactivity with XiR278 
and 2A4 , but none of the five other MAbs in the typing system. 
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Each isolate also produced a second PspA molecule with an 
apparent molecular weight 82 KDa. The 82 KDs PspA in each 
isolate was detected only with the MAb 7D2 and was designated as 
type 34. No reactivity was detected with MAbs Xil26, Xi64, 1A4, 
or SR4W4. The fact that all four capsular 6B strains exhibit two 
PspAs, based on both molecular weights and PspA serotypes, 
suggested that they might be members of the same clone. 
Simult aneous production of both PspAs ; 

Results from the colony immunobloting showed that both 
PspAs were present simultaneously in each colony of these 
isolates when grown in vitro. All colonies on each plate of the 
original culture, as well as all of the progeny colonies from a 
single colony, reacted with MAbs XiR278, 2A4, and 7D2. 
Mii^Hai- of DspA oenes i 

One explanation for the second PspA molecule was that 
these strains contained an extra pspA gene, since most strains 
contain a pspA gene and a pspA-like gene it was expected that if 
an extra gene were present one might observe at least three pspA 
homologous loci in isolates MC25-28. In Hind III digests of 
MC25-28 each strain revealed a 7.7 and 3.6 Kb band when probed 
with plSMpspA13/2 (Figure 5A) . In comparison, when Rxl DMA was 
digested with Hind III and hybridized with plSMpspAl3.2, 
homologous seguences were detected on 9.1 and 4.2 Kb fragments as 
expected from previous studies (9) (Figure 5A) . Results 
consistent with only two pspA-homologous genes in MC25-28 were 
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also obtained with digestion using^four additional enzymes (Table 
15). 

In previous studies it has been reported that probes 
for the 5' half of pspA (encoding the alpha-helical half of the 
protein) bind the pspA-like sequence of most strains only at a 
stringency of around 90%. With chromosomal digests of MC25-28 we 
observed that the 5' Rxl probe of pLSMpspA12/6 bound both pspA 
homologous bands at a stringency of greater than 95 percent. The 
same probe bound only the pspA containing fragment Rxl at a 
stringency above 95 percent (Figure 5B) . 

Further characterization of the pspA gene was done by 
RFLP analysis of PGR amplified pspA from each strain. Since 
previous studies indicated that individual strains yielded only 
one product, and since the amplification is carried out with 
primers based on a known pspA sequence, it seems likely that in 
each case the amplified products represent the pspA rather than 
the pspA-like gene. When MC25-28 were subjected to this 
procedure, an amplified pspA product of 2.1 Kb was produced in 
each case. When digested with Hha 1 digest the sum of the 
fragments obtained with each enzyme was approximately equal to 
the size of the 2.1 Kb amplified product (Figure 6). These 
results suggest that the 2.1 Kb amplified DNA represents the 
cunplified product of only a single DNA sequence. Rxl, by 
comparison, produced an amplified product of 2.0 Kb and five 
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fragments of 0.76, 0.468, 0390, 0.349 and 0.120, when digested 
with Uha 1 as expected from its known pspA sequence. 

The four isolates examined in this Example are the 
first in which two PspAs have unambiguously been observed. The 
interpretation that two PspAs are simultaneously expressed by a 
single pneumococcal isolate is based on the observation that 
bands of different molecular weights were detected by different 
MAbs t6^ PspA. ""Isolates' used in this study were from a group 
originally selected for study by Brian Spratt because of their 
resistance to penicillin. It is very likely that all four of the 
isolates making two PspAs are related since they share PspA 
serotypes, amplified pspA RFLPs, chromosomal pspA RFLPs, capsule 
type, and resistance to penicillin. 

The interpretation of . studies presented here, showing 
the existence of two PspAs in the four strains MC25-28, must be 
set in the context of what is know about the serology PspA as 
detected by Western blots. PspAs of different strains have been 
shown previously to exhibit apparent molecular weight sizes 
ranging from 60 to 200 KDa as detected by Western blots. At 
least part of this difference in size is attributable to 
secondary structure. Even for the PspA of a single isolate, band 
of several sizes are generally observed. Mutation and 
immunochemistry studies have demonstrated, however, that all of 
the different sized PspA band from Rxl are made by a single gene 
capable of encoding a 69 KDa protein. The heterogeneity of band 
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size on Western blots of PspA made by a single strain appears to 
be due to both degradation and polymerization. 

PspA was originally defined by reciprocal absorption 
studies demonstrating that a panel of MAbs to Rxl surface 
proteins each reacted with some protein and later by studies 
using Rxl and WU2 derivatives expressing various truncated forms 
of PspA. In both cases it was clear that each MAbs to the PspA 
of a giverf stra iff "reacted "with the same protein. Such detailed 
studies have not been done with each of the several hundred human 
isolates. It is possible that with some isolates, reactivity of 
the MAbs with two PspAs may have gone unnoticed. This could have 
happened if all reactive antibodies detected both PspAs of the 
same isolate, or if the most prominent migration bands from each 
of the two PspAs co-migrated. With isolates MC25-28 the 
observation of two PspAs was possible because clearly 
distinguishable bands of different molecular weights reacted 
preferentially with different MAbs. 

Applicants favor the interpretation that isolates MC25- 
28 each make two PspAs, because an alternative possibility, 
namely, that the 190 KDa PspA detected by MAbs XiR278 and 2A4 
might be a dimer of the 84 KDa monomer detected by MAb 7D2, if 
the epitopes recognized by the different MAbs were dependent on 
either the dimer ic or monomer ic status of the protein, seems 
unlikely since whenever MAbs react with the PspA of a strain, 
they usually detect both the monomeric and the dimer ic forms. No 



wo 97/09994 



PCT/US96/I4819 



Other isolates have been observed where some MAbs detected only 
the apparent dimeric form of PspA while others detected only the 
monomer ic form. 

There could be several possible explanations for the 
failure to observe two PspAs produced by most strains, i) All 
pneumococci might make two pspAs in culture, but MAbs generally 
recognize only one of them (perhaps in this isolate there has 
been a recombination .between pspa DNA and the pspA-like locus, 
thus allowing that locus to make a product detected by MAb to 
PspA). 2) All pneumococci can have two pspAs but the expression 
of one of them generally does not occur under in vitro growth 
conditions. 3) The pspA-like locus is normally a nonfunctional 
pseudogene sequence that for an unexplained reason has become 
functional in these isolates. 

It seems unlikely that the expression of only a single 
PspA by most strains is the result of a phase shift that permits 
the expression of only the pspA or pspA-like gene at any one 
time, since many of the strains examined repeatedly and 
consistently produce the same PspA. In the case of strains MC25- 
28, the appearance of two PspAs is apparently not the result of a 
phase switch, since individual colonies produced both the type 6 
and the type 34 PspAs. 

Presumably in these four strains, the second PspA 
protein is produced by the pspA-like DNA sequence. At high 
stringency, the probe comprising the coding region of the alpha- 
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helical half of PspA recognized both pspA homologous sequences of 
MC25-28 but not the pspA-like sequence of Rxl. This finding 
indicates that the pspA-like sequence of MC25-28 is more similar 
to the Rxl pspA sequence than is the Rxl pspA-like sequence. If 
the pspA-like sequence of these strains is more similar to pspA 
than most pspA- like sequences, it could explain why we were able 
to see the products of pspA-like genes of these strains with our 
MAbs.- The finding of two families of PspAs made in vivo by 
pneumococci, allows for use of the second PspA in compositions, 
as well as the use of DNA primers or probes for the second gene 
for more conclusive detecting, determining or isolating of 
pnewnococci. 

Isolates and Bacterial Cell Culture ; 

Pneumococcal isolates described in these studies were 
cultured from patients in Barcelona, Spain (one adult at 
Bellvitge Hospital, and three children at San Juan de Dios) 
between 1986 and 1988 (Table 2) . These penicillin resistant 
pneumococci originally in the collection of Dr. Brian Spratt were 
shared with applicants by Dr. Alexander Tomasz at the Rockefeller 
Institute. Rxl is a rough pneumococcus used in previous studies, 
and it is the first isolate in which pspA was sequenced. 
Bacteria were grown in Todd-Hewitt broth with 0.5% yeast extract 
or on blood agar plates overnight in a candle jar. Capsular 
serotype was confirmed by cell agglutination using Danish 
antisera (Statens Seruminstitut, Copenhagen, Denmark) as 
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previously described. The isolates were subsequently typed as 6B 
by Quellung reaction, utilizing rabbit antisera against 6A or 6B 
capsule antigen prepared by Dr. Barry Gray. 
Bacterial Ivsates: 

Cell lysates were prepared by incubating the bacterial 
cell pellet with 0.1% sodium deoxycholate, 0.01% sodium 
dedecylsulfate (SDS) , and 0.15 M sodium citrate, and then 
diluting the lysate in 0.5M Tris hydrochloride (pH 6.8) as 
previously described. Total pneumococcal protein in the lysates 
was quantitated by the bicinchonic acid method (BCA Protein Assay 
Reagent; Pierce Chemical Company, Rockford, XL) . 
PspA serotvpina : 

Serotyping of PspA was performed according to 
previously published methods. Briefly, pneumococcal cell lysates 
were subjected to SDS-PAGE, transferred to nitrocellulose 
membranes, and developed as Western blots using a panel of seven 
MAbs to PspA. PspA serotypes were assigned based on the 
particular combination of MAbs with which each PspA was reactive. 
Colony Immunoblottina : 

A ten ml tube of Todd-Hewitt broth with 0.5% yeast 
extract was inoculated with overnight growth of MC23 from a blood 
agar plate. The isolate was allowed to grow to a concentration 
of lo"^ cells/ml as determined by an O.D. of 0.07 at 590nm. MC23 
was serially diluted and spread-plated on blood agar plates to 
give approximately 100 cells per plate. The plates were allowed 
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to grow overnight in a candle jar, -and a single block agar plate 
with well-defined colonies was selected. Four nitrocellulose 
membranes were consecutively placed on the plate. Each membrane 
was lightly weighted and left in place for 5 minutes • In order 
to investigate the possibility of phase-variation between the two 
proteins detected on Western blots a single colony was picked 
from the plate, resuspended in ringers, and spread-plated onto a 
blood agar plate. The- membranes were developed as Western blots 
according to PspA serotyping methods. 
Chromosomal DNA Preparation : 

Pneumococcal chromosomal DNA was prepared as in Example 9. The 
cells were harvested, washed, lysed, and digested with 0.5% 
(wt/vol) SDS and lOQfxg/ml proteinase K at 37°C for 1 hour* The 
cell wall debris, proteins, and polysccharides were complexed 
with 1% hexadecyl trimethyl ammonium bromide (CTAB) and 0.7M 
sodium chloride at 65**C for 20 minutes, then extracted with 
chloroform/ isoamyl alcohol. DNA was precipitated with 0.6 
volumes isopropanol, washed, and resuspended in lOmM Tris-HCL, 
ImM EDTA, pH 8.0. DNA concentration was determined by 
spectrophotometric analysis at 260nm. 
Probe preparation : 

5' and 3' oligonucleotide primers homologous with 
nucleotides 1 to 26 and 1967 to 1990 of Rxl pspA (LSM 13 and 
LSM2, respectively) were used to amplify the full length pspA and 
construct probe LSMpspA13/2 from Rxl genomic DNA. 5' and 3' 
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Oligonucleotide primers homologous to nucleotides 161 to 187 and 
nucleotides 1093 to 1117 (LSM 12 and LSM 6, respectively) were 
used to amplify the variable alpha-helical region to construct 
probe LSMpspA12/6. PGR generated DNA was purified by Gene Clean 
(BiolOl Inc., Vista, CA) and random prime-labeled with 
digoxigenin-ll-dUTP using the Genius i Nonradioactive DNA 
Labeling and Detection Kit as described by the manufacturer 
(Boehringer Mannheim, Indianapolis, IN) . 
DNA electrophoresis : 

For Southern blot analysis, approximately lO/xg of 
chromosomal DNA was digested to completion with a single 
restriction endonuclease, (Hind III, Kpn 1, EcoR l, Dra 1, or Pst 
1) then electrophoresed on a 0.7% agarose gel for 16-18 hours at 
35 volts. For PGR analysis, 5ul of product were incubated with a 
single restriction endonuclease, (Bel i, BamH 1, Pst l. Sac 1, 
£coR 1 Sma 1, and Kpn 1) then electrophoresed on a 1.3% agrose 
gel for 2-3 hours at 90 volts. In both case, l Kb DNA ladder was 
used for molecular weight makers (BRL, Gaithersburg, MD) and gels 
were stained with ethidium bromide for 10 minutes and 
photographed with a ruler. 
Southern blot hYbridizatton 

The DNA in the gel was depurinated in 0.25N HCL for 10 
minutes, denatured in 0.5M NaOH and 1.5M NsCl for 30 minutes, and 
neutralized in 0.5M Tric-HCl (pH 7.2), i.SM NaCl and ImM disodium 
EDTA for 30 minutes. DNA was transferred to a nylon membrane 
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(Micron Separations INC, MA) using a POSIBLOT pressure blotter 
(Strategene, La Jolla, CA ) for 45 minutes and fixed by UV 
irradiation. The membranes were prehybridized for 3 hours at 
42-C in 50% formamide, 5X SSC, 5X Denhardt solution, 25mM sodium 
phosphate (pH 6.5), 0.5% SDS 3% (wt/vol) dextran sulfate and 
500/xg/ml of denatured salmon containing 45% formamide, 5X SSC, IX 
Denhardt solution, 20inM sodium phosphate (pH 6.5), 0.5% SDS, 3% 
dextran sulfate, 25 OAxg/ml denatured^ sheared salmon sperm DNA and 
about 20ng of heat-denatured diogoxigenin-labeled probe DNA. 
After hybridization, the membranes were washed twice in 0.1% SDS 
and 2X SSC for 3 minutes at room temperature. The membranes were 
washed twice to a final stringency of 0.1% SDS in 0.3X SSC at 
es-C for 15 minutes. This procedure yields a stringency greater 
than 95 percent. The membranes were developed using the Genius 1 
Nonradioactive DNA Labeling and Detection Kit as described by the 
manufacturer (Boehringer Mannheim, Indianapolis, IN) . To perform 
additional hybridization with other probes, the membranes were 
stripped in 0.2N NaOH/0.l%SDS at 4 0''C for 30 minutes and then 
washed twice in 2X SSC. 
Polymer ase Chain Reaction (PCR> ; 

5' and 3' primers homologous with the DNA encoding the 
N- and C-terminal ends of PspA (LSM13 and LSM2, respectively) 
were used in these experiments. Amplifications were made using 
Tag DNA polymerase, MgCl^ and lOX reaction buffer obtained from 
Promega (Madison, WI) . DNA used for PGR was prepared using the 
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method previously described in this paper. Reactions were 
conducted in 50.1 volumes containing 0.2^ of each dNTP, and 1.1 
Of each primer at a working concentration of somM. Mgci^ was 
used at an optimal concentration of- i.75«« with 0.25 units of Tag 
DNA polymerase. Ten to thirty ng of genomic DNA was added to 
each reaction tube. The amplification reactions were performed 
in a thermal cycler (M.J. Research, inc., using the following 
three step program, step 1 consisted of a denaturing temperature 
Of 94 -C for 2 minutes, step 2 consisted of 9 complete cycles of 
a denaturing temperature of 94 for 1 minute, an annealing 
temperature of 500c for 2 minutes, and an extension temperature 
Of 72 oc for 3 minutes. step 3 cycled for 19 times with a 
denaturing temperature of 940c for 1 minute, an annealing 
temperature of eo^C for 2 minutes, and an extension temperature 
Of 720C for 3 minutes. At the end of the last cycle, the samples 
were held at 72 oc for 5 minutes to ensure complete extension. 
Band aizB estjf^fl»4^».. 

Fragment sizes in the molecular weight standard and in 
the southern blot hybridization patterns were calculated from 
migration distances. The standard molecular sizes were fitted to 
a logarithmic regression model using Cricket Graph (Cricket 
software, Malvern, PA). The molecular weights of the detected 
bands were estimated by entering the logarithmic line equation 
obtained by Cricket Graph into Microsoft Excel (Microsoft 
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Corporation, Redmond, WA) in order to calculate molecular weight; 
based in migration distances observed in the Southern blot. 
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Table iS. 



Restriction 
Enzyme 



Strains Examined 



Hind Hi 
l^n I 
EooRI 
Dra I 
Pst I 



Restriction Fragments 




+ 
+ 



+ 
+ 



+ 
+ 



7.7, 3.6 
11.6. 10.6 
8.4, 7.6 
2.1. 1.1 
>14, 6.1 



9.1. 4.2 
10.6. 9.8 

7.8, 6.6 

1.9. 0.9 
10.0. 4.0 



?5rt 



wo 97/09994 



PCT/US96/14819 



Table iS. Penicillin Resistant Capsular Serogroup 6 Strains from Spain 

isolate Penicillin MIC fiio/ml> Year SItP Hospital 

MC25 1 1986 sputum Bellvitge 

MC26 4 1988 ear San Juan de Dlos 

■^027 1 1988 ear San Juan de Kos 

MC28 2 1988 ? San Juan de Dios 
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EXAMPLE 5 - Southern blot ana lysis of pspAs and Fragments of psp A 

In this example. Applicants used oligonucleotides 
derived from the DNA sequence of pspA of S. pneumoniae Rxl both 
as hybridization probes and as primers in the polymerase chain 
reaction to investigate the genetic variation and conservation of 
the different regions of pspA and pspA-like sequences. The 
probes used ranged in size from 17 to 3 3 bases and included 
sequences representing the minus 35/ the leader, the a-helical 
region, the proline-rich regions, the repeat regions, and the C- 
terminus. Applicants examined 18 different isolates representing 
12 capsular and 9 PspA serotypes. The proline-rich, repeat, and 
leader, regions were highly conserved among pspA and pspA-like 
sequence • 

In the previous Example, it was shown that strain Rxl 
and most other strains of S. pneumoniae had two homologous 
sequences that could hybridize with probes encoding the N 
terminal and C terminal halves of PspA. This conclusion that 
these were separate sequences was supported by the fact that no 
^^'tter which restriction enzymes was used there were always at 
least two (generally two sometimes three or four) restriction 
fragments of Rxl and most other strains hybridized with the pspA 
probes. When the genome of Rxl was digested with ifindlll and 
hybridized with these, two pspA-homologous sequences were found 
to be in 4.0 and 9.1 kb fragments. Using derivative of Rxl which 
had insertion mutations in pspA, it was possible to determine 
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that the 4,0 kb fragment contained the functional pspA sequence. 
The pspA-homologous sequence included within the 9.1 kb band was 
referred to as the pspA-like sequence. Whether or not the pspA- 
like sequences makes a product is not know, and none has been 
Identified in vitro. Since pspA-specif ic mutants can be 
difficult to produce in most strains, and exist for only a 
limited number of pneumococcal isolates, this Example identifies 
oligonucleotide probes that could distinguish between the pspA 
and pspA-like sequences. 

The purpose of this Example was to further define both 
the conserved and variable regions of pspA, and to determine 
whether the central proline-rich region is variable or conserved, 
and identify those domains of pspA that are most highly conserved 
in the pspA-like sequence (and ergo, provide oligonucleotides 
that can distinguish between the two) . Oligonucleotides were 
used and are therefore useful as both hybridization probes and as 
primers for polymerase chain reaction (PGR) analysis. 
Hybridization with oligonucleotide probes . 

The oligonucleotides used in this study were based on 
the previously determined sequence of Rxl PspA. Their position 
and orientation relative to the structural domains of Rxl PspA 
are shown in Figure 7. The reactivity of these oligonucleotide 
probes with the pspA and pspA-1 ike sequences was examined by 
hybridization with a Hindlll digest of Rxl genomic DNA (Table 
17) . As expected, each of the eight probes recognized the pspA- 
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containing 4.0 kb fragment of the Hindlll digested Rxl DNA. Five 
of the 8 probes (LSMl, 2, 3, 7, and 12) could also recognize the 
pspA-like sequence of the 9.1 kb band at least at low stringency. 
At high stringency four of the probes (LSM2, 3, 4 and 5) were 
specific for the 4.0 kb. 

These 8 probes were used to screen Hindlll digest of 
the DAN from 18 strains of S. pneumoniae at low and high 
stringency. For comparison" to earlier studies each of the 
strains was also screened using a full-length pspA probe. Table 
23 illustrates the iresults obtained with each strain at high 
stringency. Table 18 summarizes the reactivities of the probes 
with the strains at high and low stringency. Strain Rxl is a 
laboratory derivative of the clinical isolate, D39. The results 
obtained with both strains were identical. They are listed under 
a single heading in Table 23 and are counted as a single strain 
in Table 28. Although AC17 and AC94 are related clinical 
isolates, they have distinguishable pspAs and are listed 
separately. All of the other strains represent independent 
isolates. 

The only strain not giving at least two pspA-homologous 
Hindlll fragments was WU2. This observation was expected since 
WU2 was previously shown to have only one pspA-homologous 
sequence and to give only a single Hindlll fragment that 
hybridizes with Rxl pspA. Even at high stringency 6 of the 8 
probes detected more than one fragment in at least one of the 18 
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strains Tables 18 and 23. Probes LSM7, 10. and 12 reacted with 
DNA from a majority of the strains and detected two fragments in 
over 59% of the strains they reacted with. Tn almost every case 
the fragments detected by the oligonucleotide probes were 
identical in size to those detected by the full-length pspA 
probe. Moreover, the same pairs of fragments were frequently 
detected by probes from the 3' as well as the 5' half Rxl pspA. 
These results are consistent^ with earlier findings that the pairs 
of Hxndlll fragments from individual isolated generally include 
two separate but homologous sequences, rather than fragments of a 
single pspA gene. 

The differences in the frequency with which the 
oligonucleotides reacted with (at least one fragment) of the 
strains in the panel was significant at P < 0.0001 by 2 x 8 chi 
square) . When the oligonucleotides were compared in terms of 
their ability to react with both fragments of each strain the P 
value was also < 0.0001. Table 18 gives the percentage of 
strains reactive with each probe, the percentage in which only 
one fragment was reactive, and the percentage in which two (or 
more) fragments were reactive. 

The last column in Table 18 give the ratio of strains 
that showed one reactive Hindlll fragment at high stringency 
divided by the total number of reactive strains. In this column 
values of 1 were obtained with probes that only reacted with one 
band in each reactive strain. Such probes are assumed to be 
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those that are roost specific for pspA. The lowest values were 
obtained with probes that generally see two bands in each strain. 
Such probes are assumed to be those that represent regions 
relatively conserved between the pspA and pspA-like sequences. 
At high stringency, probes LSM3 and LSM4 detected only a single 
Hljidlll fragment in the DAN of strains they reacted with. These 
findings suggested probes LSM3 and LSM4 were generally detecting 
alleles of pspA rather ^IS," -the pspA- like sequence. The 
observation that the fragments detected by LSM3 or LSM4 were also 
detected by all of the other reactive probes, strengthened the 
conclusion that these probes generally detected the pspA rather 
than the pspA-like sequence. WU2 has only one pspA-homologous 
DNA sequence and secretes a serologically detectable PspA. The 
fact that LSM3 reacts with the single HindlU fragment of WU2 is 
consistent with the interpretation that LSM3 detects the pspA 
sequences. Sequences representing the second proline region 
(LSMl) and the C-terminus (LSM2) appeared to also be relatively 
specific for the pspA sequences since they were generally 
detected in only one of the Jfindlll fragments of each strain. 

Oligonucleotides, LSM12, and LSMIO detected the most 
conserved epitopes of pspA and generally reacted with both pspA- 
homologous fragments of each strain (Table 18). LSM7 was not 
quite as broadly cross-reactive but detected two PspAs in 41% of 
strains including almost 60% of the strains it reacted with. 
Thus, sequences representing the leader, first proline region, 
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and the repeat region appear to be relatively conserved not only 
within pspA but between the pspA and pspA-like sequences. LSM3, 
4, and 5 reacted with the DNA from the smallest fraction of 
strains of any oligonucleotide (29 - 35 percent) , suggesting that 
the portion of pspA encoding the a-helical region is the least 
conserved region of pspA. 

With two strains BG85C and LB 1905, the oligonucleotides 
detected more than two f/indlll fragme'nts containing pspA- 
homologous sequences. Because of the small size of the 
oligonucleotide probes and the absence of Hindlll restriction 
sites within any of them, it is very unlikely that these multiple 
fragments were the results of fragmentation of the target DNA 
within the probed regions. In almost every case the extra 
oligonucleotides were detected at high stringency by more than 
one oligonucleotide. These data strongly suggest that at least 
in these two strains there are 3 or 4 sequences homologous to at 
least portions of the pspA. The probes most reactive with these 
additional sequences are those for the leader, the a-helical 
region and the proline rich region. The evidence for the 
existence of these additional pspA-related sequences was 
strengthened by results with BG58C and L81905 at low stringency 
where the LSM3 (a-helical) primer picked up the extra 1.2 kb band 
of L81905 (in addition to the 3.6 kb band) and the LSM7 (proline- 
rich) primer picked up the extra 3.2 and 1.4 kb bands (in 
addition to the 3.6 kb band) of BG58C. 
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Amplification of pspA 

The utility of these oligonucleotides as PGR primers 
was examined by determining if they could amplify fragments of 
pspA from the genomic DNA of different pneumococcal isolates. 
Applicants attempted to amplify pspAs from 14 diverse strains of 
5, pneumoniae comprising 12 different capsular types using 
primers based on the Rxl pspA sequence. Applicants observed that 
the 3/ primer LSM2r which- is located at the 3' end of pspA, would 
amplify an apparent pspA sequence from each of the 14 
pneumococcal strains when used in combination with LSMl located 
in the region of pspA encoding the proline-rich region (Table 
19). LSM2 was also used in combination with four other 5' 
primers LSMl, 3, 7, 8 and 12. LSM8 is located 5' of the pspA 
start site (near the -35 region) . 

If a predominant sequence of the expected length was 
amplified that could be detected on a Southern blot with a full- 
length pspA probe, we assumed that pspA gene of the amplified DNA 
had homologous sequences similar to those of the pspA primers 
used. Based on these criteria the primer representing the a- 
helical sequence was found to be less conserved than the primers 
representing the leader, proline, and C- terminal sequences. 
These results were consistent with those observed for 
hybridization. The lowest frequency of amplification was 
observed with LSM8 which is from the Rxl sequence 5' of the pspA 
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start site. This oligonucleotide was not used in the 
hybridization studies • 

Further evidence for variability comes from differences 
in the sizes of the amplified pspA gene. The Example showed that 
when PGR primers LSM12 and LSM2 were used to amplify the entire 
coding region of PspA, PGR products from different pneumococcal 
isolates ranged in size from 1.9 and 2.3 kb (Table 20). The 
regions -within pspA encoding the a-helical," prolinerrich, and 
repeats were also amplified from the same isolates. As seen in 
Table 20, the variation in size of pspA appeared to come largely 
from variation in the size of pspA encoding encodes the a-helical 
region. 

Using probes that consisted of approximately the 5' and 
3' halves of pspA it has been determined that the portion of pspA 
that encodes the a-helical regions is less conserved than the 
portion of pspA that encodes the C-terminal half of the molecule. 
This Example show using 4 oligonucleotide probes from within each 
half of the DNA encoding PspA. Since a larger number of smaller 
probes were used. Applicants have been able to obtain a higher 
resolution picture of conserved and variable sequences within 
pspA and have also been able to identify regions of likely 
differences and similarities between pspA and the pspA-like 
sequences. 

The only strains in which the pspA gene has been 
identified by molecular mutations are Rxl, D39 and WU2. Rxl and 
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D39 apparently have identical pspA molecules that are the result 
of the common laboratory origin of these two strains. WU2 lacks 
the pspA-like gene. Thus, when most pneumococci are examined by 
Southern blotting using full length-pspA as a probe, it is not 
possible to distinguish between the pspA and pspA-like loco, 
since both are readily detected. A major aim of these studies 
was to attempt to identify conserved and variable regions within 
,the pspA and pspA- li}cg^^^ci. ^Jl, related- aim was to determine 
whether probes based on the Rxl pspA could be identified that 
would permit one to differentiate pspA from the pspA-like 
sequence. Ideally such probes would be based on relatively 
conserved portion of the pspA sequence that was quite different 
in the pspA-like sequence. A useful pspA specific probe would be 
expected to identify the known Rxl and WU2 pspA genes and 
identify only a single Hindlll fragment in most other strains. 
Two probes (LSM3 and LSM4) never reacted with more than one pspA- 
homologous sequence in any particular strain. Both of reacted 
with Rxl pspA and LSM3 reacted with WU2 pspA. Each of these 
probes reacted with 4 of the other 15 strains. When these probes 
identified a band, however, the band was generally also detected 
by all other Rxl probes reactive with that strain's DNA. 
Additional evidence that the LSM3 and LSM4 were restricted to 
reactivity with pspA was that they reacted with the same bands in 
all three non-Rxl strains. Each probe identifies pspA in certain 
strains and even when used in combination they recognized pspA in 
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over 40 percent of strains. Probes for the second proline-rich 
region (LSMl) and the C-terminus of pspA (LSM2) generally, but 
not always, identified only one pspA-homologous sequence at high 
stringency. Collectively LSMl, 2, 3, and 4 reacted with 16 of 
the 17 isolates and in each case revealed a consensus band 
recognized by most to all of the reactive probes. 

By making the assumption that in different strains the 
Rxl pspA probes are more^-ilikely^ to recognize pspA than the pspA- 
like sequences, it is possible to make some predictions about 
areas of conservation and variability within the pspA and pspA- 
like sequences. When a probe detected only a single pspA- 
homologous sequence in an isolate, it was assumed that it was 
pspA. If the probe detected two pspA-homologous sequences, it 
was assumed that it was reacting with both the pspA and pspA-like 
sequence. Thus, the approximate frequency with which a probe 
detects pspA can be read from Table 18 as the percent of strains 
where it detects at least one pspA-homologous band. The 
approximate frequency with which the probes detect the pspA-like 
sequence is the percent of strains in which two or more pspA- 
homologous band are detected. 

Using these assumptions the most variable portion of 
portion of the pspA gene was observed to be the -35 region and 
the portion encoding a-helical region. The most' conserved 
portion of pspA was found to be the repeat region, the leader and 
the proline rich region. Although only one probe from the region 
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was used, the high degree of conservation among the lo repeats in 
the Rxl sequence makes it likely that other probes for the repeat 
regions give similar results. 

The portion of the pspA- like sequence most similar to 
Rxl pspA was that encoding the leader sequence, the 5' portion of 
the proline rich region, and the repeat region, and those 
portions encoding the N-terminal end of the proline-rich and 
repeat regions. The- repeat -region of PspA has been shown to be 
involved in the attachment to PspA to the pneumococcal surface. 
The conservation of the repeat region among both pspA and pspA- 
like genes suggests that if is PspA-like protein is produced, 
that it may have a surface attachment mechanism similar to that 
of PspA. The need for a functional attachment site may explain 
the conservation of the repeat region. Moreover, the 
conservation in DNA encoding the repeat regions of the pspA and 
pspA-like genes suggests that the repeat regions may serve as a 
potential anti-pneumococcal drug target. The conservation in the 
leader sequence between pspA and the pspA- like sequence was also 
not surprising since similar conservation has been reported for 
the leader sequence of other gram positive proteins, such as M 
protein of group A streptococci, it is noteworthily, however, 
that there is little evidence at the DNA level that the PspA lead 
is shared by many genes other than PspA and the possible gene 
product of the pspA-like locus. 
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Although the region encoding the C-terminus of pspA 
(LSM12) or the 3' portion of the proline-rich sequence (LSMl) 
appear to be highly conserved within pspA genes, corresponding 
regions in the pspA-like sequences are either lacking, or very, 
distinct from those in pspA. The reason for conservation at 
these sites is not apparent. In the case of the PspA, its C- 
terminus does not appear to be necessary for attachment, since 
mutants lacking the C^erminaW4,9 amino acids are apparently as 
tightly attached to the cell surface as those with the complete 
sequence. Whether these difference from pspA portends a subtle 
difference in the mechanism of attachment of proteins produced by 
these two sequences in unknown. If the C-terminal end of the 
pspA-like sequence, or the 3' portion of the proline-rich 
sequence in the pspA-like sequence are as conserved within the 
pspA-like family of genes as it is within pspA, then this region 
of pspA and the pspA-like sequence serve as targets for the 
development of probes to distinguish between all pspA and pspA- 
like genes. 

With two strains, some of the oligonucleotide probes 
identified more than two pspA-homologous sequences. In the case 
of each of these strains, there was a predominant sequence 
recognized by almost all of the probes, and two or three 
additional sequences that were each recognized by at least two of 
the probes. One interpretation of the data is that there may be 
more than two pspA-homologous genes in some strains. The 
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significance of such sequences is far from established. It is of 

interest however, that although the additional sequences is far 

from established. It is of interest however, that although the 

additional sequences share areas of homology with the leader, o- 

helical, and proline region, they exhibited no homology with the 

repeat region of the C-terminus of pspA. These sequences, thus, 

night serve as elements that can recombine with pspA and/ or the 

pspA-like sequences^to generate sequence diversity. 

Alternatively the sequences might produce molecules with very 

different C- terminal regions, and might not be surface attached. 

If these pspA- like sequences make products, however, they, like 

PspA, may be valuable as a component of a pneumococcal antigenic, 

immiinological vaccine compositions. 

Bacter ial strains > arovth conditions and isolation 
of ehromoaomal DNA . 

S. pneumoniae strains used in this study are listed in 
Table 5. Strains were grown in 100 ml of Todd-Hewitt broth with 
0.5% yeast extract at 37<*C to an approximate density of 5x10® 
cells/ml. Following harvesting of the cells by centrif ugation 
(2900xg, 10 minutes) , the DNA was isolated as previously 
described and stored at 4*>C in TE (lOmM Tris, ImM EDTA, pH 8.0). 
Amplif ication of pspA sequences > 

Polymerase chain reaction (PGR) primers, which were 
also used as oligonucleotide probes in Southern hybridizations, 
were designed based on the sequence of pspA from pneumococcal 
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strain Rxl- These oligonucleotides were obtained from Oligos 
Etc. (Wilsonville, OR) and are listed in Table 22. 

PCRs were done with a MJ Research, Inc., Programmable 
Thermal Cycler (Watertown, MA) as previously described using 
approximately 10 ng of genomic pneumococcal DAN with appropriate 
5' and 3' primer pair. The sample was brought to a total volume 
of 50 ^1 containing a final concentration of 50mM KCl, lOmM Tris- 
HCr (PH 8.3), T;5 mM-MgCl2, (nolDl% gelatin, 0.5 inM each primer, 
200mM of each deoxynucleotide triphosphate, and 2.5 U of Tag DNA 
polymerase. Following overlaying of the samples with 50 ^1 of 
mineral oil, the samples were denatured at 94*^0 for 2 minutes. 
Then the samples were subjected to 10 cycles consisting of 1 
minute at 94*C, 2 minutes at 50«C, and 3 minutes at 72**C followed 
by another 20 cycles of 1 minute at 94**C, 2 minutes at 50 *C, arid 
3 minutes at 12^C followed by another 20 cycles of 1 minute at 
94«*C, 2 minutes at 60«C, and 3 minutes at 72 *C. After all 30 
cycles, the samples were held at 72**C for an additional 5 minutes 
prior to cooling to 4**C. The PGR products were analyzed by 
agarose gel electrophoresis. 
DKA hybridization analysis . 

Approximately S/xg of chromosomal DNA was digested with 
Hindlll according to the manufacturer's instructions (Promega, 
Inc., Madison, WI) . The digested DNA was electrophoresesed at 35 
mV overnight in a 0.8% agarose gels and then vacuum-blotted onto 
Nytran membranes (Schleicher & Schuell, Keene, NH) . 
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Labeling of oligonucleotide with and detection of 
probe-target hybrids were both performed with the Genius System 
according to the manufacturer's instructions (Mannheim, 
Indianapolis, IN). All hybridizations were done for 18 hours at 
42 "C without formamide. By assuming that l% base-pair 
mismatching results in a l»c decrease in Tm designations of 
"high" and "low" stringency were defined by salt concentration 
and temperature^ofpost^bridization washes. Homology between 
probe and target sequences was derived using calculated Tro the 
established method. High stringency is defined as 90% or greater 
homology, and low stringency is 80-85% sequence homology. 
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Table l^Hybridizadon of oligonucleotides with ^ihdin restriction' 
fragments of Rxl DNA, . 



Oligonucleotide 



Region 



Low 



Stringeaicy . 



LSM12 

LSMS 

LSM3 

LSM4 

LSNf7 

LSMl 

LSMIO 

LSM2 



Leader 
a-helix 
a-helix 
a-helix 
Proline 
Proline 
Repeats 
CMeiminus 



NJ>. 
NJ). 
4.0, 9.1 
4.0 

4.0, 9.1 
4.0,9.1 

NJ>. 

4.0, 9.1 



4.0. 9.1 
4.0 
4.0 
4.0 

4.0, 9.1 
4.0, 9.1 
4.0, 9.1 
4.0 



Note. Values indicated=arc tiie-sizes of rratriction fragments expressed as 
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Table 28. Summary of Hybridization at High and Low Stringency of 

8 Oligonucleotides with Hindm Restriction Fragments of 

the 17 Pneumococcal Isolates Listed in Figure ?. 

Oligonucleotide Percent Percent Percent l band/ 

with riband with ^ bands with 1 band riband 



— Low High Low Hi^ Low High Low Hiah 

^ f ^ ^ ^ 

. 29 18 12 040 

S II 0 24 35 036 lioo 

^^^^^^^ 7? 4? 1J If, 

88 S3 4 1 12 47 41 0J3 0.78 

Note, for all values listed aU 17 strains were examined. If no value is . 
listed, then no strains were examined. 
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Table |jL Ampiification of Pneamococcal Isolates using the Indicated S* Prime 



Combination with the 3' Primer . LSM2 at the 3' end of pspA 


S' Primer 


Region 


Nucleotide 


Amplified/ 


Peree« 


Position 


Tested 


Amplilta 


LSM8 


- 35 


47 to 70 


2/14 


14 


LSM12 


leader 


162 to 188 


8/14 


57 


LSM3 


aohelieal 


376 to 598 


3/14 


21 


LSM7 


proline 


1093 to 1117 


12/14 


86 


LSMl 


proline 


1312 to 1331 


14/14 


too 



Note, by 2x5 chi square analysis the diffezset primers aa^lified 
- different freqnenei«^N>f:-pspAs^(P < 0.0001). The tendency for there 
to be more amplification with the 3' most primers was significant at 
P <0.0001. 
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Table' ag Size of a mplified dsdA fragments in kilobases 
Region Primer Pairs tmabapspAs Size Rnge SJ). 

examined ■ 

Follleiaati LSMI2»LSM2 9 i.9-?? OA 0.17 

a-lieUcal LSM124-LSM6 6 0^ o!l7 

Proline LSM74'LSM9 3 0 0 

Repeals,. LSMH-.LSM2-^ 19. 0.6^.65 OJiS OOl 

Note: amplifi ra tin n was attempted with each set of primers on a panel of 19 difierept pspAs. 
Data is shown only for pspAs that could be amplified with the indicated primer pairs. 
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Table 3i Pneumococcal strains 



strain 


Relevant characteristics 


WU2 


Capsular type 3, PspA type 1 


D39 


Capsular type 2, PspA type 25 


R36A 


Nonencapsulated mutant of D39, 




PspA. type 25 


Rxl 


Nonencapsulated vanant of R36A, 




PspA type 25 


DBL5 


Capsular Qrpe 5, PspA type 33 


DBL6A 


Capsular type 6A, PspA type 19 


A66 


Capsular type 3» PspA type 13 


AC94 


Capsular type 9L, PspA type 0 


AC17 


Capsular type 9L, PspA type 0 


AC40 


Capsular type 9L, PspA type 0 


AC 107 


C^sular type 9 V, PspA type 0 


AClOO 


Capsular type 9 V, PspA type 0 


AC140 


Capsular type 9N, PspA type 18 


D109-1B 


Capsular type 23. PspA type 12 


BG9709 


Capsular type 9, PspA type 0 


BG58C 


Capsular type 6A, PspA type ND 


L8190S 


Capsular type 4, PspA type 25 


L82233 


Capsular type 14, PspA type 0 


L82006 


Capsular type 1. PspA type 0 
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Primer • Sequence (5^ to 30 



LSMl CCOOATCXIAGCTCCrGCACCAAAAAC ~ 

Lac GCG0GTC0Aa3aCTrAAACCCATrcACCATTOG 

LSM3 CCOGATCCTOAGCCAGAGCACnTGGCro 

Law COJCSATOCGCrCAAAGAGATrGATaAGTCTG 

1^ CCGGATCCCGTAGCCAOTCAGTCTAAAOCIG 

fS£ SS«SS?^*^^^^°^'n^CTOGACC^ 

ia£ COQOATOIAGCroCAGCnrAQAAACrcCAG 

JSS S£2Si3Fi3T^^^^TATrrACGGAGGAGGC 

LSM9 GrrmTGGTGCAGGAGCTGG 

LSMIO SeTATGeGTACAGGTra 

LSMll CCACCIGTAGCCATAGC 

J^J? CCGGATCCAGCGTGCCTATCITAGGGGCTGGTr 

LSM13 GCAAGCTTATGATATAGAAATITGTAAC 
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EXAMPLE 6 - Restriction Fragment Length Polymorphisms of psnA 
Reveals Grouping 

Pneumococcal surface A (PspA) is a protection eliciting 
protein of Streptococcus pneumoniae. The deduced amino acid 
sequence of PspA predicts three distinct domains; an a helical 
coiled-coil region, followed by two adjacent proline-rich 
regions, and ten 20 amino acid repeats. Almost all PspA 
molecules are cross-reactive with each other in variable degrees. 
However, using a panel of monoclonal antibodies specific for 
individual epitopes, this protein has been shown to exhibit 
considerable variability even within strains of the same capsular 
type. Oligonucleotide primers based on the sequence of pspA from 
S. pneumoniae Rxl were used to amplify the full-length pspA gene 
and the 5' portion of the gene including the a-helical and the 
proline-rich region. PCR-amplif ied product were digested with 
Hha 1 or Sau3A I to visualize restriction fragment length 
polymorphism of pspA. Although strains were collected from 
around the world and represented 21 different capsular types, 
isolates could be grouped into 17 families or subfamilies based 
on their RFLP pattern. The validity of this approach was 
confirmed by demonstrating that pspA of individual strains which 
are known to be clonal ly related were always found within a 
single pspA family. 

Numerous techniques have been employed in 
epidemiological surveillance of pneumococci which include 
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serotyping, ribotypingi pulsed field electrophoresis, multilocus 
enzyme electrophoresis, penicillin-binding protein patterns, and 
DNA fingerprinting • Previous studies have also utilized the 
variability of pneumococcal surface protein A (PspA) to 
differentiate pneumococci. This protein, which can elicit 
protective antipneumococcal antibodies, is a virulence factor 
found on all pneumococcal isolates. Although PspA molecules are 
commonly cross-reactive, they are seldom antigenically identical. 
This surface protein is the most serologically diverse protein 
know on pneumococci; therefore, it is an excellent market to be 
used to follow individual strains. Variations in PspA and the 
DNA surrounding its structural gene have proven useful for 
differentiation of 5. pneumoniae. 

When polyclonal sera are used to identify PspA, cross- 
reaction is observed between virtually all isolates. Conversely, 
when panels of monoclonal antibodies are used to compare PspA of 
independent isolation they are almost always observed to express 
different combinations of PspA epitopes. A typing system based 
on this approach has limitations because it does not easily 
account for differences in monoclonal binding strength to 
different PspA molecules. Moreover, some strains are weakly 
reactive with individual monoclonal antibodies and may not always 
give consistent results. 

A less ambiguous typing system that takes advantage of 
the diversity of PspA was therefore necessary to develop and was 
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used to examine the clonality of strains. This method involves 
examination of the DNA within and adjacent to the pspA locus. 
Southern hybridizations of pneumococcal chrbmosomal DNA digested 
with various endonucleases, such as Hind III, Dra I, or Kpn I, 
and probed with labeled pspA provided a means to study the 
variability of the chromosome surrounding pspA. When genomic DNA 
is probed, the pspA and the pspA-like loci are revealed. In roost 
digests the pspA probe hybridizes to 2-3 fragments and, digests 
of independent isolates were generally dissimilar. 

Like the monoclonal typing system, the Southern 
hybridization procedure permitted the detection of clones of 
pneumococci. However, it did not provide a molecular approach 
for following pspA diversity. Many of the restriction sites 
defining the restriction fragment length polymorphism (RFLP) were 
outside of the pspA gene, and it was difficult to differentiate 
the pspA gene from the pspA-like locus. In an effort to develop 
a system to follow pspA diversity Applicants examined the RFLP of 
PCR-amplif ied pspA, Amplified pspA was digested with Sau3A I and 
Hha 1, restriction enzymes with four base recognition sites. To 
evaluate the utility of this approach pspA from clinical and 
laboratory strains known to be clonally related as well as random 
isolates were examined. 
Bacterial strains 

Derivatives of the S pneumoniae D39-Rxl family were 
kindly provided by Rob Massure and Sanford Lacks (Figure 8). 
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Eight clinical isolates from Spain and four isolates from 
Hungary, a gift from Alexander Tomasz. Seventy-five random 
clinical isolates from Alabams, Sweden, Alaska, and Canada were 
also studied. 
PCR amplifications 

The oligonucleotide primers used in this study are 
listed in Table 24, Chromosomal DNA, which was isolated 
according to procedures described by Dillard et al., was used as 
template for the PCR reactions. Amplification was accomplished 
in a 50 ^1 reaction containing approximately 50 ng template DNA, 
.25U Tag, 50 of each primer, 17 5 /zM MgCl2, and 200 /iM dNTP in 
a reaction buffer containing 10 /xM Tris-HCl, pH 9.0, 50/iM KCl, 
0.1% Triton X-100, 0-01% wt/vol. gelatin. The mixture was 
overlaid with mineral oil, and placed in a DNA thermal cycler, 
The amplification program consisted on an initial denaturation 
step at 94**C, followed by 29 cycles opf 94**C for 1 min, 55<»C for 
2 min, and 72**C for 3 min. The final cycle included an 
incubation at 72**C for 5 min. 

Restriction fragment analysis of PCR-amplif ied product 

Aliguots of the PCR mixtures were digested with Hha I 
or Sau3A I in a final volume of 20^1 according to manufacturer's 
protocols. After digestion the DNA fragments were 
electrophoresed on a 1.3% TBE agarose gel and stained with 
ethidium bromide. Fragment sizes were estimated by comparison to 
a Ikb DNA ladder (Gibco BRL) . 
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Because of the variability of pspA, and the fact that 
the entire pspA sequence is known for only one gene, it has not 
been possible to design primers which amplify pspA from 100% of 
pneumococcal strains. However, oligonucleotide primers, LSM2 and 
LSMl, can amplify an 800 bp region of the C-terminal end in 72 of 
the 72 stains tested. Based on hybridizations at different 
stringencies, this region was found to be relatively conserved in 
pneumococcal strains, and thus would not be expected to be 
optimal for following restriction polymorphisms within the pspA 
molecule. LSM13 and LSM2, primers which amplify the full length 
pspA gene, can amplify pspA from approximately 79% 55/75 of the 
strains tested (Table 25) • 

Stability of amplified RFLP pattern within clonally related 
pneimococci 

To determine the stability of pspA during long passages 
in vitro, we examined the RFLP pattern of the pspA gene of the 
derivatives of the S. pneumoniae D39-Rxl family. Rxl is an 
acapsular derivative of S. pneumoniae D39, the prototypical 
pneumococcal laboratory strain isolated by Avery in 1914. 
Throughout the 1900 's spontaneous and chemical mutations have 
been introduced into D39 by different laboratories (Figure 8). 
During this period unencapsulated strains were maintained in 
vitro, and D39 was passed both in vivo and in vitro passage. All 
the derivatives of D39, including Rxl, R6, RNC, and R36A, 
produced a 1.9 kb fragment upon PGR amplification of full length 
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pspA. All members of the family exhibited the RFLP pattern. 
Digestion with Sau3A I of PCR amplified full length pspA revealed 
a .83, .58, ,36 and a .27 kb fragment in all of the D39-rXl 
derivatives of the family. Digesting the full length pspA with 
Hha I resulted in bands which were .76. .47, .39, .35, and .12 kb 
(Figure 9 or Table 2 6). 

The stability of pspA polymorphism was also 
investigated using pneumococcal isolates which had previously 
been shown to be clonally related by other criteria, including 
capsule type, antibiotic resistance, enzyme electromorph, and 
PspA serotype. Three sets of isolates, all of which were highly 
penicillin resistant, were collected from patients during an 
outbreak in Hungary and two separate outbreaks in Spain. PCR 
amplified full length pspA from the capsular type 19A 
pneumococcal strains from the outbreak in Hungary, DB18, DB19, 
DB20, and DB21, resulted in a band approximately 2.0 kb. After 
digesting full length pspA with Hha I, four fragments were 
visualized., 89, .48, and .28 kb. Digestion with Sau3A I yielded 
five fragments .880, .75, .35, .34, and .lOkb. Capsule type 6B 
pneumococcal strains, DBl, DB2, DB3, and DB4 , were obtained from 
an outbreak in Spain. Full length pspA from these strains were 
approximately 1.9 kb. Digestion of the PCR-amplif ied fragment 
with Hhs I resulted in four fragments which were .83, .43, .33, 
and .28 kb. Sau3A I digestion yield a .88, .75, .34, and .10 kg 
fragments. DB6, DB8, and DB9, which are capsular serotype 23F 
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Strains, were isolated from a second outbreak in Spain. DB6, 
DB8, and DB9 had an amplified pspA product which was 2.0 kb. Hha 
I digested fragments were .90, .52, .34, and .30 kb and Sau3A I 
fragments were .75, .52, .39, .22, .20, and .10 kb in size 
(Figure 10) . DB7 had a 19A capsular serotype and was not 
identical to DB6, DB8, and DB9. In the D39/Rxl family and in 
each of the three outbreak families the size of the fragments 
obtained from the Hha I and the Sau3A I digests totaled 
approximately 2.0 kb which is expected if the amplified product 
represents a single pspA sequence. 

Diversity of rflp pattern of amplified pspA from random 
pneumococcal isolates 

PGR amplification of the pspA gene from 70 random 

clinical pneumococcal isolates yielded full-length pspA ranging 

in size from 1.8 kb to 2.3 kb. RFLP analysis of PCR-derived pspA 

revealed two to six DNA fragments ranging in size from 100 bp to 

1.9 kb depending on the strain. The calculated sum of the 

fragments never exceeded the size of the original amplified 

fragment. Not all pneumococcal strains had a unique pspA, and 

some seemingly unrelated isolates from different geographical 

regions and different capsular types exhibited similar RFLP 

patterns. Isolates were grouped into families based on the 

number of fragments produced by Hha I and Sau3h I digests and the 

relative size of these fragments. 
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Based on the RFLP patterns it was possible to identify 
17 families with four of the families containing pairs of 
subfamilies. Within families all of the restriction fragments 
were essentially the same regardless which restriction enzyme was 
used. The subfamilies represent situations where two families 
share most but not all the restriction fragments. With certain 
strains an FRLP pattern was observed where detectable fragment 
size differed from the pattern of the established family by less 
than 100 bp. Since the differences were considered small 
compared to the differences in the fragment size and the number 
of fragments between families, they were not considered in family 
designation. The RFLP pattern of two isolates from six of the 
families is pictured in Figure ll, Table 27. These families were 
completely independent of the capsular type or the protein type 
as identified by monoclonal antibodies (Table 28 and 29) . 

Previous DNA hybridization studies have demonstrated 
that the pspA gene of different isolates are the most conserved 
in their 3' region of the gene and more variable in the 5' region 
of the gene. Thus, if seemed likely that the differences in the 
pspA families reflected primarily differences in the 5' end of 
the gene. To confirm this theory, the a helical and proline 
region of pspA was examined without the amino acid repeats. 
Nucleotide primers LSM13 and KSH2 were used to amplify this 
fragment which is approximately 1,6 kb. Examination of this 
region of pspA afforded two things. 
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This primer pair permitted amplification of 90% of the 
strains which is greater than the 75% of the strains which can be 
amplified with oligonucleotides which amplify the full length 
gene. Second, it allowed Applicants to examine if the original 
groupings which were based on the full length gene coincide with 
the fingerprint patterns obtained by looking at the 5' half of 
the gene. 

Figure 12 contains the same strains which were examined 
in Figure 11 but the PCR products were amplified with SKH2 and 
IiSM13. The RFLP patterns obtained from digestion of the 
amplified a helical and proline rich region confirms the original 
designated families. However, these primers amplify a smaller 
portion of the psaA and therefore the difference is the families 
is not as dramatic as the RFLP patterns obtained from the RFLP 
pattern of the full length gene. 

The polymerase chain reaction has simplified the 
process of analyzing pspA gene and have provided a means of using 
pspA diversity to examine the epidemiology of S. pneumonia&. 
Because not all strains contained a unique fingerprint of pspA, 
RFLP patterns of pspA cannot be used alone to identify the 
clonality of a strain. These results indicate the RFLP of PCR- 
amplified pspA from pneumococcal strains in conjunction with 
other techniques may be useful for identifying the clonal 
relatedness among pneumococcal isolates, and that this pattern is 
stable over long passages in vitro. 
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These findings suggests that the population of pspA is 
not as diverse as originally believed. PCR-RFLP of pspA may 
perhaps represent a relatively simplistic technique to quickly 
access the variability of the gene within a population. Further, 
these findings enable techniques to diagnose. S. pneumoniaB via 
PGR or hybridization by primers on probes to regions of pspA 
common within groupings. 
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The sequffltce studies divide the known strains into several families based on 
sequence homologies. Sequence data demonst«tes that there havTbeS^^^e 
recoinbmafaonsoccurimginnatureivlthinpip^ Hie net e&rt^tS 
recombmafaon Is t^«t the "fammes''identifi^ by specific sequenc^^^pendin. 

and fte proline nch region are each distinct and differ sHghtly from each otheT M 
addifam there is consilSabie evidence of other diversity ^cludSJLe^tuS 

This result indicates that it is expected that there will be a continuum of overlaoDin ' 
sequences of PspAs, rather than a discrete set of sequences. o^e^iappm 

J^thf!^^ '"^'^''^^ ^^"^ ^ S^^^^^t conservation of sequence in the 3' ha 

^^ot^'^Sl'''''^'^^^^^^'''^^ Because the diversityintffeirS 
^f ttie a-hehcal region is greater, this region is of Htde use in predirting ck^ 

Thuaftheseqle^f 

A^^^L alpha-hehcal i^on and the 3' tip of the coding sequence axe Hkely to 

romnncf^^^i^*^ • the proline-ridi region may not be particularly impotant to • 
composition of a vaccme because this region has not been shown to be able to elidt 

cross-protection even though it is highly conseived. The reason for this is presuma^/y 
because antibodies to epitopes in this region are not surface exposed. ' 

Based on our present sequences of 27 diverse pspAs we have found that there 
are 4 families of the 3* half of the a-hdical region and 2-3 families of the very 5* tip 
the a-helical region. Together these form 6 combinations of tlie 3' and 5' families. 
This approadi ttierefore should permit us to identify a panel of pspAs with 3" and 5 
hdicalsequencesrepresentativeof tiie greatest number of different ;;s;?As. St^p^l^, 
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Table At Relationship of Capsular ^e and RFLP family. 
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Table 'Sil^c Ollgonueleotlcfos used In this study. 



Dttlgnatlen 


Saquanea 5'-3' 


Nuelaetldtt 
position 


L8M2 


GCG C6T CQA CGG CTT 

AAACCCATTCACCAT 

TOG 


1990 to 1067 


L8M1 


CXSQQATCCAGCTCCT 
QCACCAAAAAC 


131210 1331 


LSM13 
SKH2 


GCAAGCTTATQATAT 
AgAAATTTGTAAC 

CCA C5AT ACC QTT TTC 
TTGTTTCCAQCC 


1to26 
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Table 2£ Amplification of pspA from a panel of 72 independent isolates* of S. 
pneumoniae. 



CAPSULE 
TYPE 


NUMBER OF 
EXAMINED 


LSM13 AND 
LSM2 


LSM13 AND 
SKH2 






% OF STRAINS 
AMPUREO 


% OP !7TRAIKi<; 
AMPLIFIED 


1 


3 


100 


100 


2 


1 


100 


100 


3 


8 


50 


87 


4 


6 


67 


100 


5 


1 


100 


100 


6 


7 


29 


86 


6A 


2 


100 


100 


6B 


6 


100 


100 


7 


2 


50 


100 


8 


1 


100 


100 


9V 


3 


100 


100 


9A 


2 


100 


100 


9L 


1 


100 


100 


9N 


3 


100 


100 


10 


1 


100 


100 


11 


2 


50 


100 


12 


2 


0 


100 


13 


1 


100 


100 


14 


4 


0 


75 


15 


2 


50 


50 


19 


5 


100 


100 


22 


3 


33 


100 


23 


1 


100 


100 


33 


1 


0 


100 


35 


1 


0 


100 


nd 


3 


100 


100 



*Our strain collection contains several groups of isolates known to be previously to be clonal and 
collected for that purpose. The data reported in the table includes only one representative isolate 
from such clonal groups. 
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Table Rx1-D39 derivatives 



ISOLATE 


SIZE OF Hha I DIGESTS 






(Kb) 


DIGESTS (Kb) 


D39 


.76, .47, .39, .35, .12 


.83, .58, .36. .27 


Rxl 


.76. .47, .39, .35, .12 


.83. .58, .36. .27 


R800 


76, .47, .39, .35, .12 


.83, .58, .36, .27 


R6 


76. .47. .39, .35. .12 


.83, .58, .36, .27 


R61 


76. .47. .39, .35, .12 


.83, .58. .36. .27 


R6X 


.76, .47, .39, .35, .12 


.83, .58. .36. .27 


R36NC 


.76, .47. .39, .35, .12 


.83. .58, .36, .27 


R36A 


.76. .47. .39. .35, .12 


.83. .58, .36. .27 
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TABLE distrain Information and family designation of Independent isolates. 



STRAIN 


CAPSULE 


PspA 


FAMILY 


SIZE OF Hha 


i SI2£ OF Sau3A 1 




TYPE 


TYPE 




FRAGMENTS 


FRAGMENTS 
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1 OS 3"? 
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1 05 3S OO 
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1-0. .80. .35 
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ND 
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4 


20 


E 
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1.15, .50, .34 
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4 


ND 


E 


1.0,.40. J3 


1.15. .50, .34 


BG7619 


10 


ND 
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.82, .76, .35 


BG7941 


11 


ND 


F 
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.82, .76, .35 


BG7813 


14 


8 


H 


1.05, .70. .36 


.90, .77, .35 


BG7736 


8 


ND 


H 


1.05, .70, .36 


.90, .77. .35 


AC113 


9A 


ND 


1 


1.4, .34, .28 


1.2, .80 


AC9g 


9V 


5 


1 


1.4. .34. ^8 


1.2. .80 



MP 
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Tabled^.Relationshxp of RFLP family and PspA type. 



pspA 
FAMILY 


KELAnONSHlr BETWEEN PSPA TOE AND RFLP FAMILY 

PlspAType ~ ■ 
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EXAMPLE 7 - Ability of PspA ixmnunogens to protect against 
individual challenge strains 

CBA/N or BALB cJ mice were given 1 injection of 0.5 - 
Mg PspA in CFA, followed 2 weeks later by a boost in saline, and 
challenged between 7 and 14 (average 10) days post boost. 
Control mice were administered a similar immunization regimen, 
except that the immunization came from an isogeneic strain unable 
to make PspA, The PspA was either full length, isolated from 
pneumococdi or cloned full length or BClOO PspA, as little 
statistical significance has been seen in immunogenicity between 
full length PspA and BCIOO. The challenge doses ranged from 
about 10^ to 10^ pneunocci in inoculum, but in all cases the 
challenge was at least 100 times LD^q. 

The results are shown in the following Tables 30 to 60^ 
and the conclusions set forth therein. 

From the data, it appears that an antigenic, 
immunological or vaccine composition can contain any two to 
seven, preferably three to five PspA, e.g., PspAs from R36A and 
BG9739, alone, or combined with any or all of PspAs from Wu2, 
Ef5668, and DB15. Note that surprisingly WU2 PspA provided 
better protection against D39 that did R36a/Rxl/D39 , and that 
also surprisingly PspA from Wu2 protected better against BG9739 
than did PspA from BG9739. Combinations containing R36A, BG9739 
and WU2 PspAs were most widely protective; and therefore, a 
preferred composition can contain any three PspA, preferably 
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R36A, BG9739 and WU2 . The data in this Example shows that PspA 
from varying strains is protective, and that it is possible to 
formulate protective compositions using any PspA or any 
combination of the PspAs from the eight different PspAs employed 
in the tests. Similarly, one can select PspaS on the basis of 
the groupings in the previous Example. Note additionally that 
each of PspA from R36A, BG9739, EF5668 and DBL5 are, from the 
data, "good for use in compositions. 

A note about use of medians rather than averag eR. 
Applicants have chosen to express data as median (a non- 
parametric parameter) rather than averages because the times to 
death do not follow a normal distribution. In fact there are 
generally two peaks. One is around day 3 or 6 when most of the 
mice die and the other is at > 21 for mice that live. Thus, it 
becomes nonsensical to average values like 21 or 22 with values 
like 3 or 6. One mouse that lives out of 5 has a tremendous 
effect on such an average but very little effect on the median. 
Thus, the median becomes the most robust estimator of time to 
death of most of the mice. 
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Relative ability of different PspAs to Protect against each challenge strains of S, 

pneumoniae 



(Summary of statistically significant protection) 
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Note: Empty cells indicate that no experiment has been done. Bold means significant at P < 0.05, 
Small font bold W means 0.02 < 0.05. Large font bold means P < 0.02, For this table statistical 
significance refers to delay in time to death except as indicate in the (+) footnote below. When "all 
immune" showed significant protection against death but individual data cells did not, the result for 
"all inunune" is presented under best protection on the assumption that if more mice were done in 
each data cell one or more of them would have exhibited significant protection against death. 
+++ = statisticaUy significant protection against death; ^ 50% protection from death 
++ = statistically significant protection against death; < 50% protection from death 
+± - statistically significant delay in deatih; ^0 protection from death 

+ = statistically significant delay in death; <20 protection firom death, (or significant protection 

against death but not a significant delay in death) 
0++ = IM statistically delay in time to death; but k 50% protection from death 
0+ = Not statistically delay in time to deatii; but >15 day extension in median time to death or ^0% 

protection from death. 
0 = No apparent extension in time to death or protection from death. 
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Relative ebilily of different PspAs to Protect against each ehaUengnSs of sT 

ptumnoniae 

(Expre^ed as Median days Alive post challenge) 
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Ability of different PspAs to Protect Against Each Challenge stxain of 

S. pneutnonuic 

(Expressed as increase in survival time in days) 

(A denotes ^ 50% immune mice aiive) 
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Note: Bold denotes statistxcally significant extensim of life at P < 00)5, SmaU foat dcaetc«lU)KP<0Ji9; large font 
denotes P < (UI2. Median increases in survival listed as 3, 9 or 1 A denote groups where the median d^ 
within a continuum of values. In ttiese cases the vahies ghre are those closest to the calculated median. 
Fractional values such as 3.5, indicate that ^ median b halfway between two numbets* in this case 3 and 4. 



Relative ability of different PspAs to Protect against eadi challenge strains of 

S. pneumoniae 
(expressed % alive at 21 days post challenge) 







Vaccine PspA 


R36A. 
EU1«D39 


iD90a 
WU2 


JS1Q20/ 
BC9739 


EF3296 


EF5668 


L81905 


ISSOlOl 
DBL5 


DBL6A 


All 


All 


Chalknge 


Caps 


PspA 


pspA 


Strain 






famii) 


K 


a 


b 


E 


CD 


b 


n 


D 






D39 


2 


25 


K 


38 


60 






30 








38 


3 


WU2 


3 


1 


a 


100 


100 


100 




100 


100 


100 


100 


100 




A66 


3 


13 


a 


75 


100 


80 




75 


100 


60 


20 


76 • 


5 




3 


18 


M 


100 




SO 












90 


0 




3 


7 


a 


100 
















100 


0 




4 


26 


b 


11 


60 


13 


"zsr 


ff" 


25 






12 


0 




4 


2U 


E 


'ii 




Iff" 








0 


0 


8 


0 




4 


U 


CD 


22 


25 


40 


40 


100 


40 


«0 


0 


41 


9 




4 


23 




10 


0 


31 


40 


0 


0 


14 


0 


14 


0 


DBL5 


5 


33 


a 


10 




14 




0 


0 


29 


0 


4 


0 


EF6796 


6A 


1 


t 


100 
















100 


0 


DBUA 


6A 


19 


D 


67 


25 


33 


0 


60 


25 


0 


80 


35 


4 


BG9163 


6B 


21 


c 


89 








80 








86 


20 


BG7322 


6B 


24 


c 


100 


60 


25 


0 


89 


25 


80 


25 


55 


6 



Bold, denotes statistxcaiiy significanl protection against death at P < 0JJ5. Bold smaU fon^, indicates significant 
protection against death at 0.02 S P < 0^. Bold large font Indicates significant protection against death at P < 0i>2. 
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Relative ability of difft^uit PspAs to Protect against each chall^^e strain of 

S. pneumoniae 

(% protected from death at 21.days post challenge) 





R34A. 

wm 


WU2 
ID9U8 


BG9739 
jSlOZO 


EF3296 


EB666 


LB19a5 


DBLS 
ISSOIOJ 


DBL6A 


All 


Btst 
mult 


Qutknge 


Caps 


PspA 




Strain 


iBEL 


iaje 


famil) 


K 


a 


b 


E 


DO 


b 


a 


D 






D39 


2 


25 


K 


36 


59 






a 








36 


59 


WU2 


3 


1 


a 


100 


100 


100 






A66 




13 


a 


71 


100 


79 




74 


100 


58 


W 1 75 


100 




3 


18 


M 


100 




80 










1 90 


100 




3 


"7" 


a { 






















4 


26 


IT 


n 


60 


U 


23 


0 


2S 


0 


0 


u 


60 




4 




E 


25 


20 


10 








0 


0 


6 


2S 


EF566B 


4 


12 




14 


U 


9$ 


34 


100 


34 


M 


•10 


35 


100 


L81905 


4 


"tT 




10 


0 


31 


40 


0 


0 


14 


0 


14 


40 


bftLS 


i 




n 


10 




14 




0 


0 


29 


0 


4 


29 


JEF6796 


6A 


1 


c 


100 
















100 


100 


DBL6A 


6A 


19 


D 


66 


22 


30 


-4 


58 


22 


•4 


79 


33 


79 




6B 


21 


t 


86 




75 












83 


86 


807322 


rsr 


IT 


C 


100 


57 


22 


0 


88 


22 


79 


22 


52. 


100 



. . . <0jD5. Bold smaU font, ifMiif«»»c>igw^fi^^ 

protection against death at Oitt i ? < Bold large font indicates significant protection against deatti at P < Oitt. 
% pr otected has been coirected for any survivors in the control mice. 

% protected - 100 x C% alive in inunnne - % alive in control)/C100 -% alive in control). Thus, if there were any 
inice aUve in the control animals, the calculated protected* is less than Ote observed aUve" listed in the 
previous Ubie. The only exceptions to this are if 100% of inununized mice Ihred. Negative numbers mean that 
less immunixed mice lived than did control mice. Please note that none of these negative numbers are yHmlfiriTit 
even though we are using a one tailed test 
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y/tm^5 

Recommended Immunogens to Protection against the indicated challenge 
stnuns of S. pneumoniae Based on Protection Score 
Based on median days alive and percent protected 
(numbers refer to pr e fei ence as a vaccine attain with respect to the indicated 

challenge stiaiiu 
la best) 
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Bol4 denotes stotistically significant protection against death at P<0i)5. Where more than one FspA were 
equallyprotectiver Ae same values were ghren to each. Recommendations are based on days to 
death with % protection diyiding ties, espedaUy aixiong those where greato 
to21day5* '*0"iridicates test were coruiucted but compared to the other F$pAs this or^ 
r ec omme nded . 

Conclusions: 

Statistically significant protection against death vtrith >50% protection; 11/14 of the strains s 79% 
SUtistfcally significant protection against death; 13/14 strains = 93% 
Statistically sigrdficant extension of life in 14/14 or 100% of strains. 
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Criterion 



i#l PspAfor 
each challenge 
strain 
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This is not a unique combination. See table belo\v. 



Combinations where aU Challenge Strains have a vaccine strain 1 
with a score of i#2 — 1 
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Pooled Data for Protection against D39 by various PspAs; 
Days aitve for each mouse 
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Pooled Data for Proicctjon against WU2. by various PspAs 
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4x>21 


4x>21 




WAX 
>21 














Sx>Z3 












22.2.7;i 


niw 
Baoo 
£143 


"53" 


UtfA/N 

CBA/N 




4x>21 








•it>10 








U5xX3w4 
1,1,2^,2.1 


£144 
£172 


3-9 
3.98 


CBA/N 
CBA/N 






Sx>21 












ftx>21 


Ss2 
5x3 


AH 






>21 
19:0 

100 


>21 
4:0 

KJOOOl 

100 


>21 
5:0 
KJOOOl 

100 


>21 
8:0 
<.00O1 

100 


>21 
4:0 

KJOOOl 

100 


>21 
8:0 

KJOOOl 

100 


>21 
4:0 
<.0001 

100 


>21 
.4:0 
<.0OOI 

100 


>21 
4:0 
<.000t 

100 


2 
1:64 

2 




FL-R36A " 


Baoo 


jtnos 


JS1020 
fBC9739) 


BC9739 
bdOO 1 


EF5668 J 


L819QS 


OBLS 
bdOQ |i 


153020 
13BL6A) 


control 



wo 97/09994 



PCT/US96/14819 



V7Ui 


days o( death 


meditn days 
of death 


dead 


P value 
based on days 
to death 


P value 
based on 
alive : dead 


Score 


alive 


% 
prot. 


luununc 


6lB>21 


>21 


61:0 


<X001 


<.0001 




100 


100 


All 
contfols 


6x1,33x2.201 
3, 4.14.6^, >21 


5 










i "' 


2 



Pooled Data for Protection against A66> by various PspAs 



Exp. 


CFU 
A66 


Mice 




Days to Death/ ixnmunoRen 


FI^R36Ay 
D3» 


lUl 
BClOO 


jO908 
(WU2) 


CB09739) 


fiC9739 
bclOO 


EF5668 


U1905 
Ft 


U1905 
bclOO 


jSSOlOJFL 
(DBL5) 


DBLS 
b^OO 


JS3020 
(DDUA) 


conlml • 


£169 


Z60 




S]i>21 




Sx >21 




















£152^ 
telflO 


xn 












4«>21 






4j«>21 




4i!>21 




3s X 2. 4,6,, 
>21 


Eim 


-30 


rsm 








rAM >22 










3,4.401022 






£143 


3.0 














4,4Jt 
>10 












2j:j3 


E140 


3.43 


5a7k 




4<>21 




















1,1.1 


£172 


3.94 


ClA/h 














5x>21 












£145 


3.97 




13. 4x >21 






















l.Z2,2.4 


£121 


4.16 


caA/^ 


3x3,2x1 
5x>21 






















U8x2.>21 


Ail 






3x3,a4, 
I3.14x>2] 


4x>21 


Sx>21 


2,84x>21 


4x>21 


4.4X 
>21 


5x>21 


4x>21 


3.4A2X >21 


4x>21 


24,4,5>21 


7xl,22xX 
3x3.4.3x6 
ax>2l 


median; 
A: D 


>21 
14:6 


>21 
4:0 


>21 

5:0 


>2l 

3:2 


>2l 

5:0 


>21 
4:1 


>21 
5:0 


>2l 
4:0 


4 

2:3 


>21 
4:0 


4 
1:4 


2 

2:36 


P values 


<0i0001 

<aoooi 


0i>002 
0.0001 


<0.0001 
<0.0001 


OuOM 
0.0075 


<Oi)001 


00006 
OiK)6 


cOiWOI 
cO.0OOt 


Oj0002 

aoool 


0JX}25 
n.s. 


0J)002 
0.0001 


OOIS 
n.s. 






























Mifu Pools 




JO900 


BG9739 


EF3666 


L81905 


DBU 
3.4,4.4.6k>21 


DBUA 


Contfol 




>21 
18:6 


>21 
5:0 


>21 
8:2 


>21 
4:1 


>21 

9:0 


>21 
6:4 


4 

1:4 


2 

2:36 


P vftlaues 
lank/ard 


<0i)001 


<aoooi 

<0.0001 


<aoooi 


0u0006 
0.0K 


<0.0001 


OiXXM 


ouns 
ns. 




Score 
% alive 


72 
71 


100 
100 


80 
79 


75 
74 


100 
100 


60 
58 


♦t 
20 
16 


5 

0 


A66 challenge 


iU«A/iUl/WC44.1 




BG9739 


Em68 


LS1905 


DBL5 


DBUA 





A66 challence 


days of death 


median 
days alive 


alive : 
dead 


P* days to 
death 


P -aUve: 
dead 


Scoce 


% 
alive 


% 

protected 


Aii immune 


2,2.4x3,7x4.5, 
8.U,S0x>21 


>21 


50:16 


<0iX>0] 


<0.0001 


+++ 


76 


75 


AUcontroU 


7x1,22x2.3x3, 
4.3x6, 2x>21 


i "■ 


' 








.5 


0 



Pooled Data for Protection against EF10197. by various PspAs | 


Exp. 


CFU 

£F 
10197 


Mice 


Days to Death/ immunoKen 


Bxl 
BClOO 


151020 
(BC9739) 


L81905 


JS302D 
(DBUA) 




JS5010L3FL 
rDBL5)0 


GOntfol 


£140 


3.00 


CBA/N 


Sx>21 












2Z2 


VUBOC 


2.70 


CDA/N 


• 












7.2J,2.2 


£129 


334 


CBA/N 




8.4x>23 










2X2.2,9 



lioX 



wo 97/09994 



PCT/US96/14819 



Pool of Pc 


>ols for protection aRaisnt EF10197 


line 


Group 
Description 


DcUy in deatA and/or tarvival 


Survival 


days to death (mcdain) 


P vahjca ttc 


4iiv«:<tcftd 


? values etc 


U 


fUl (EI40) 


5i>2l 


01)17 vs lb 
0LOOU¥t4b 


5:0 


0018 vs lb 
Oi)0e«s4b 


3a 


JS1020 (E129) 


8.ix>23 




4:1 




4« 


atl Iminunt 


8»tit>21 




9:1 




lb 


Rxlcontrota(El40) 






0:3 




2b 


MI DOC 






0:5 




3b 


fSia20cDnt(E129} 


2,2.2^19 




0:5 




4b 








0:8 








Summary of protection aRainst EF10197 








median DOD 




Score* 


IRjd 1 5:0 


100 


100 


>21 


OuOt? 


0J)18 




flsiHi 1 in 


60 


80 


>21 


0X007 


00124 






90 


90 


>21 


<0lO0O1 


0l0002* 




Laucontfols f 0:8 


0 


0 


2 









Pooled Data for Protection against ATCC6303. by various 

PspAs 



Bcp. 


CFU 

ATCC 
6309 


Mice 


Days to Death/ immunogen 


Rat 
BQOO 


iS1020 
(BG9739) 


L81905 


JS302O 
(DBUA) 


EFS866 


J550UL3FL 
(DBL5)0 


COfttfpl 


E140 


230 


CBA/N 


Sa>21 












4.4x5 


E129 


X80 


CBA/N 




ILV. 













Unt 


Group 
Description 


Delay in dcatli and/or tunrival 


Survival 


days to death (mcdain) 


Pvahics«tc 


altv«:dcMl 


P vakKSttc 


la 


Rxl (E140) 


5«>21 021) 


0jO04O 


5:0 




lb 




4,4«5 5 




0:5 





|i'fi'fi,'-i'-,'-viH ■ iii'iH'ii nm !ii';amsrnnii 


medianDOD 




|iUl 1 5:0 


100 


100 


>2t 


ao(H 


04)04 j j 


liUlcontmb | 0:5 


0 


0 


3 







wo 97/09994 



PCT/US96/148I9 



cxp. 


CFU 


Mice 








Days to Death/ immunogen 


0C973S 


R36A 
FL 


BON 
(ftxt) 


JD909 
<WU2) 


151020 
[BG9739] 


bclX 
[8C973 
f> 


EF3296 
FL 


EF566a 
r k 


bclOO 
(LI190S; 


jSSOIOJ 

FL 
(DBL5) 


bclOO 


)S3030 
[DBL6 
A) 


omirol 


£140 


2.76 


CBA/N 




34,10.11 




















2,2,3 


E104 




Xid 








6.6744 










2444.4 




U44. 
3 


24444 


H125 


356 


CBA/N 










5447 






44.13. 
>21 




244.4 




14X44 
6 


1:172 


3.71 








6^ 7»3x 


















3.4447 


H124 




Aid 






















U44. 
9 


24444 


£084 


4.(b 


BALB/c 








4t2.2x 
>14 
















9k2 


H144 


4.09 


Xid 


1 










U47. 
>I0 . 


2444.4 










24444 






























































All 






I 


34.10.11 


>2: 


U2.6.6 
7.8. «. 
2x>21 


5447 


2447 
>2l 


2444.4 




7x2,3.3 
4 




aK2.3. 
9 


2U2, 
7xX3x 
4.3i5. 
3x6,7 








3 


3. 10 


>21 


6 


5 


3 


3 


5.U 


2 


2 


2 


. 2 


a:d 






1:4 


0:4 


3:2 


2:9 


0:4 


1:4 


0:5 


1:3 


0:10 


0:4 


0:10 


0:38 































































Pboled Data for Protection against BC9739. by bclOOs and FL Psp As 



Exp. 


CFU 


Mice 




Days to Deatli/ immunogen 




BC973S 




R36A 
FL 


BClOO 
IRxl) 


ID908 
(WU2) 


)SI020 
[BG9739; 


bcIOO 

tBovra 
») 


EF3296 
FL 


EF5668 
FL 


bclOO 
(LSIMS) 


|SSaUL3 

FL 
(DBL5> 


bclOO 
(D6L5] 


ISSOZO 
[DBU 
A) 




£140 


Z76 


CBA/N 




34.10.11 




















2.24 


£104 




Xid 








64744 










2444.4 




U44. 
3 


U444 


£125 


3^6 


CBA/N 










5447 






44.U. 
>21 




244.4 




14.4.44 
6 


£172 


371 


CBA/N 






6.7,3x 
>21 


















3.4447 


£124 




Xid 


















24444 




U44. 
9 


Z4444 


£084 


4J0S 


BALB/C 








4Jt2.1x 
>14 
















9x2 


£144 


4.09 


Xid 


244.6. 
>21 










U47. 
>10 


2444.4 












FL-»bclXBG9739 


R36A/RX1/D39 


WU2 


BC9739 


EF3296 


EFSM8 


U190S 


DB 


LS 


OBUA 


Cont 


AU 


2.te3.6,ian.>2] 


6.7.3X 
>21 


4x2.3x312x6^ 
2x7.2xil2x>21 


1447. 
>21 


24x3,4 


44.13. 
>21 ' 


10x2.3.3.4.4 


6*2,3. 
9 


21x2. 
7xX3x 
4.3x1 
3x6w7 




3 


>21 


6 


3 


3 


5,U 


2 


2 


2 




1: 


8 


3:2 


2:13 


1:4 


0:5 


1:3 


0:14 


0:10 


0:38 


P - 


dxysal 


hre 


ojom 






ns. 


ILJ. 


0.0022 


Its. 


Its. 




P- alive: cUad 


lU. 




ILSL 


as. 


XLS. 




fiS. 


ILS. 




SCOK 


♦ 




♦ 


(h- 


0 


♦± 


0 


0 




% aliva 


11 


60 


13 


25 


0 


25 


0 


0 


0 


% 


pfottctt 


cd 


11 


60 


U 


25 


0 


25 


0 


0 


0 


BC9739 dullcn^ 


R36A/RX1/D39 




DBL5 1 





wo 97/09994 



PCT/US96/148I9 



OC9739 


days of 
death 


median 
dayiof 
dtath 


alivi 
dcaa 


e value 
boMdon 
dajrsto 
death 


F value 
baiedan 
aUve : 
dead 


Score 


% Alive 


% 




AU immune 




3 










12 


U 




All controls 




■ i 


0:55 







I Pooled Data for Protection against Et-azge. by various PspAs \ 



Exp. 


CFU 


Mice 


Days to Death/ imxnunogen 


EF3296 


BCIOO 


P906 

Win 


IS1020 
(B09739) 


JSS01Q3H 
(DB15) 


iS3Q20 
(DBUA) 


COQllOt 


EMl 


3.99 


BALB/^ 






4x2,>14 






9x2 


E140 


r92 


CBA/N 


3«<6.>21 










3J4 


E104 


3.11 


CBA/N 








2JJ3J 


2X3.43 


2XZ3.4 


EUi 


3.94 


CBA/lsr 








uja2 


1,1XZ2 




Ein 


4.06 


CDA/N 














All 


3,4,fi^>21 


33^^^21 


4x2.4.3x5w 
6.>21 


1A^2,3J 


l.U5x2,X 
4.5 


2.3x3 
4,4x6 


median days to death 


5 


5 


4J 


2 


2 


2 


alive : dead 


1:3 


1:4 


1:9 


0:9 


0:10 


0:27 


P • dan to death 


Qjoan 


0J009i 


M. 


OS. 


OS. 




P-aUv«:de«l 


n.s. 


CLS. 


as. 


HA 




OS. 


Score 


♦t 


♦t 


Ov 


0 


0 




% alive 


25 


20 


10 


0 


0 


0 


%PfDt 


25 


20 


10 


0 


0 


0 


Beat 














EE3296 
challenge 


Rxl 
BQOO 


)D90S 

WTO 


;sio2o 

(BG9739) 


JS5010JH. 
(DBL5) 


JS3020 
(DDUA) 


oootcol 



EEI296 
challenge 


OMdian 
dayt alhre 


aUve : 
dead 


P -days 
to death 


P*aUve : 
dead 


Soort 


% 
alive 


% 


All immune 


3 


3:33 




fU. 


0 


6 


B 


All comroi 


1 


0:»" 





Pooled Data for Protection against EF5668« by various 
FL*PspAs andbdOOs 

Exp, iCFUj Mice| Days to Death/ immtinogen 





EF 
5668 




R36A 


Rxl 
BOOO 


JD90a 
(WU2) 


151020 
(BC9739) 


EF3296 


EF566B 


U1905 


1550103 

FL 
DBL5 


lS3oao 

DBL6A 


control 


£143 


2J0 


caA/h 












5x>10 








1.1.20. 
>10 


El«l 


339 


caA/K 




4^.12>21 
















2.4.6 


E171 


3^ 


OA/M 






2.X2.3. 
>2l 








3A4, 
2x>21 






IA6A7 


















U4x >i; 


3.43.6.6 


334,4.9 


£145 


3.94 


caA/N 


X4,4, 
16. >19 






2.ia 
3x >19 


2x>19 










2,XX4. 


Pool 






3.3x4,6.12.16. 
2x>21 


12.2,3. 
>21 


2.ia 

3x>21 


2.4U 
2x>21 


5x>21 


3.3.4. 
2x>21 


3.3.3X 
>21 


3.43A* 


1x1, 4xZ 
6xX3x4. 
3xd,7,9. 
2x>21 


median days alive 


6 


2 


>21 


U 


>21 


4 


>21 


5 


3 


alive : dead 


2:7 


1:4 


3:2 


2:3 


5:0 


2:3 


3:2 


0:5 


2:21 


P •days alhre 


0^13 


n^. 


0.0187 


as. 


aool 


as. 


as. 


asw 




P - alive : dead 




n.s. 


0.027 


ILS. 




1 ILS. 


0.027 


as. 




Score 


•f 


0 




0* 




Or 


♦ 


04^ 




% alivr 


22 


25 


60 


40 


100 


40 


60 


0 


9 


%pmt 


14 


18 


56 


34 


100 


34 


56 


-10 


9 




R36A/RxVD39 


WU2 


BC9739 


EF3296 


EFS668 


U19QS 


DBLS 


DBL6A 


oootfol 



wo 97/09994 



PCT/US96/14819 



Summary of protection against EF6796 





medUnDOO 


P -lim* alive 


P ftiivc v» dtad 




4:0 


100 


100 




a029 


0.029 


com rob 


0:3 


0 


0 


1 







Fooled Data for Protection against DBL6A. by various 
FL PspAs and bdOO PspAs 





CFU 

DBU 
A 


Mka 


Days to Death/ immunogen 


BQOO 
Ral 


R36A 


JD90S 

Win 


)S1Q2D 
BC9739 


bdOO 
BG9739 


EF3296 


EB66S 


U1905 
FL 


bdOO 
Ln905 


JS5010J 
DBL5 


bclOO 
DBL5 


JSIOU 
DBUA 


conUol 


£171 


2M 


CJUJh 






47A9, 
>21 










3J,7A 
>21 










2^4A6 


£152 














15. 3x 
>21 








746. 
2x>21 




a,iau, 

21 




3x3,4,3x6 


£140 


3^ 


ciAyK 


4«>2l 
























4.7.7 


£146 


4jr 






2x>21 








6, a, 9, 
10.10 


iai3, 

3x>21 






7ia;l2, 

13.U 




4x^21 


4;433,1S 


£129 


4.14 


CftA/N 








J,6A10, 
13 


















4,5,6,8>23 


Total 




















Name of Pools 


IU6A/IU1/D39 


WU2 


BC9ra9 




LB1905 


DELS 


DBUA 


conmla 


Pooled dau 


7,«,t0,6x>21 


>21 


3,6440,1345, 
3x>21 


6,6,9, 
10,10 


iai3, 

3x>21 


3,37,7A16, 
Zx>21 


7,«A10,12, 
3xU.21 


»,4x>21 


2,U3, 
6x13x5, 
6x6,7.7. 
8, IB, >21 


median daya aUvc 


>21 


%S 


13 


9 


>21 


8 


12 


>2l 


5 


aliv« : dead 


6:3 


1:3 


3:6 


0:5 


3:2 


2:6 


0:9 


4:1 


ia4 


P * days alive 






00025 




0^7 


0.002 






P * alive : dead 


0.0019 


fLl. 


0iM8 








as. 






SODte 


4"«-4- 


+± 


♦-I- 


+± 


++♦ 


♦± 


+± 


+■♦'4' 






67 


25 


33 


0 


60 


25 


0 


80 


4 




66 


22 


30 


-4 


SB 


22 


-4 


79 


0 


DBUA challcnfct 


R56A/IU1/D39 


WU2 


BC9739 




U1905 


DELS 





DBL6A 
chaltcAgt 



Air 



dap of 
dtath 



AH 



median 
days of 
death 



12.5 



Alhre : 
dead 



19:35 



? — TTir 



P value 
based on dap 
to death 



<0J)001 



P value 
baaadon 
aUve tdced 



00019 



Scote 



% 
alWc 



35 



% 
prot. 



33 



Fooled Data for Protection against BG9163 by 
various PspAs 



Exp. 


CFU 

BG9163 


Mice 


Days to Death/ inimunogen 


Rxl 




jSI020 
(BG9739) 


all InuBttiM 




£169 


2.67 


CBA/N 


5x>24 








4^33^24 


£140 


3.14 


CBA/N 


n.v. 










£129 


4i) 


CBA/N 






U,4x>23 




7,9A13, 
>23 






6,3x>2l 






5.6.8,10 


bisnunofflBv 


RX1/R36A/D39 






Pooled Dau 


6,Bx>21 


12,4x>21 


6,12,12x>21 


44,743A 
9.12Jtx>21 


median dan alive • 


>21 


>2l 


>21 


8.5 


alhre : dead 


8:1 


4:1 


12:2 


2:8 


P -davs alive 


a0086 


0LO097 


0i)027 




P- alive: dud 


0.0045 


0047 


0.0022 




% alive 


89 


80 


86 


20 


%prot 


86 


75 


83 


0 


SCOfV 










E0163 ChalUaea | IUVR36A/D39 


BC9739 


all Intxnune 





WO97/099W PCT/US96/14819 







median 


alive : 


P value 


f value 


Score 


% 


% 


EF5668 


death 


dayiof 


dead 


based on dayi 


baMdon 




alive 


prot 






death 




to death 


allvt :dead 






AUtoununc 




8 


1S:26 




OOGS 


++ 


41 


35 

















Pooled Data for Protection against L81905. by various FL-PspAs 



Exp. 


CFU 

L8190S 


Mice 






— — " J— 1 

Days to Death/ immunogcn | 


K36A 1 BClOO 1 JOsdT" 
(1U1) (WU2) 


jsuao 

rB09739y 


309739 


EF3296 


£F5668 


IJ19DS 


JS5010J 
(DBl^ 


bdOO 


153020 1 <acittt4 | 
rOBUA)! 1 




3.4J(M 




















i:^r!iBK¥rarinT:vy7 


























£084 




BALB 








2.2.Sa 
>14 


















£104 




CBA/N 








3^Aaji 










X3Ja> 

>22 




3X53*6 


X4. 4,4.5 


£124 


-3^ 


CBA/N 






















2A233 


IXIXI 


feiiT 


3.6 


CfiA/N 
















3,4.6^ 




4333 






£144 


4.11 


CBA/N 


>to 










>10 














Alt 






>21 
5 


5 


3,4^^^ 
5 


>21 


7 


>10 
6 


2A3J^ 




4*2.4x3. 
2«>21 


4333 


3x2-3,X 
4. 

3;xS,6 


UU20V2 
tKXfa4. 
4x5 








1:4 


0:5 


0:5 


5:7 


0:4 


2:3 


3 

0:5 


5 
0:4 


3 
2:S 


5 
0:4 


33 
0:10 


2 
0:40 





























































































bclOOs & FL-PspAs pooled together 


Exp. 


CFU 

LB19Q5 


Mice 






Days to Death/ immunogen . | 


U&A 


BClOO 
(Rxl) 


JO90» 

(WU2) 


151020 
(BG9739) 


b^lDO 
BQ9739 


EF3296 


Ef5666 


bclOO 
L8190S 


jS5010L3 
(DBL5> 


bcUO 
COBL5) 


153020 
(DBL6A) 


ceotrol 


E172 


Z45 








3^33 


















33A4.4 


£140 


3.11 


CBA/N 




23333 




















7.7.7.1.1 


E0&4 


336 


BALB 








2,2.5x 
>14- 
















Ulx2 


£104 


-3.5 


CfiA/N 








3p^3All 










333.2X 
>22 




3,4333 


2.4,4.4.5 


£124 
£125 


-3.5 
3^ 


COA/N 
CBA/N 










5333 












2A233 


1.2.2.2.2 


£144 


4.11 


CBA/N 


333A 
>10 










433. 2x 
>I0 


U333 


3333 




4333 












ZA3.3xS,6^3>21 


3.4333 


2.233A73X Ml. 
Sx>21 


63A2X 
>10 


W333 


3,433 


4x2.4x3 

2X3 


1.4333. 
►21 


3x^3,3,4. 
3xSw6 


1.1,20x2 
Ix3l4x4. 
4x5 




5 


5 


8 


6 


3 


5 


3 


33 


2 


alive: dead 


1:9 


0:5 


5:11 


2:3 


0:5 


0:4 


2:12 


0:10 


0:40 


? - days alive 




0.0035 


<OlOO01 


0.0002 


as. 


0J)1 


0035 


0044 




P - alive: dead - 




n-s. 


03001 


Oi)l 












score 




-f 






0 




-¥ 


♦ 




% alive 


10 


0 


31 


40 


0 


0 


14 


0 


0 


% protected 




















fchallenge with L81905 


R36A/RX1/D39 


WU2 


BC29739 




DBLS 


DBI3A 


conttole 



wo 97/09994 



PCT/US96/14819 





daysol 
death 


III 


Alive : 
dead 


P value 
based on dayi 
to death 


P value 
baaed on 
allvt :dcad 


Score 


% 
alive 


% 

prot 


All immune 




5 


10:59 


<ojm\ 


0008 




14 


14 


All control 




"■ i 







Pooled Data for Protection against DBL5 by various 
FLrPspAs febdOOs 



Exp 


CFL 
DBL5 


Mice 


Days to Death/ immunogen 


R36A 


eaoo 

Rxl 


JSUOO 
B09739 


bdOO 
151020 


EF5666 


bdOO 
LS1905 


)SS0UL3 
DBLS 


bclOO 
DBLS 


JS3Q20 
DBUA 


cootiol 








6x2 














»x2 


EWO" 




CBA/N 




S 
















2A2 


£104 


339 


Xid 






30,6. 
>22,>22 








>22,>22 






2r4x3 


£124 


lJ?i 


Xtd 














ts 




5x2 


U*2A2 


£125 


3Sl 


CBA/N 
















2A2>21 




Sx2,S 




4.13 


XID 


>10 








4 










5x2 




tota 


1 






















name of pool 


iU6A/RjiyD3S 


BG9739 


\miiiii\SSS 


DBLS 


DBL6A 




pooled data 


4x3,2x4,3x5, 
>21 


6x2.4x3,4,5. 
>21, >2l 


2J13AX 




6x2*5,7,7,15, 
4x>21 


7x^4,5 
5 


1*1,26x2, 
4x3,5 


median days alive 


4 


3 


3 


3J 


6 


2 


2 


alive : dead 


1:9 


2:U 


0:4 


0:4 


4:10 


0:10 


0:32 


P - days alive 


<OJ00O\ 


0L0063 


Ml 


0001 


0JJO25 


nj. 




P-aUve:dad 


A3. 


as. 


O-S. 


tLf. 


a0056 


njw 




Score 


♦ 


♦ 


+• 


*- 


♦4- 


0 




% aiivtt 


10 


14 


0 


0 


29 


0 


0 




10 


14 


0 


0 


29 


0 


0 


1 DBL5 challenge 


RJ6Amxt/D3S 


BCf739 


EF5668 


Ltl9IB 


DBLS 


DBL6A 


contiola 



^ Thlt fanr*"nT**H^ •i*t»«>t i>«pA Note BAtB/c| mke «rare used. Also note 10* Challenge CFU. 



DBLS 
challenge 


days of 


median 


alive : 


P value 


Pvahie 


Score 


% 


% 


death 


days of 


dead 


based on days 


based on 




alive 


prot 




death 




to death 


alive :dcad 








All immune 




3J 


7:49 


kQJXOI 


0.034 




3.6 




All conttot 




J 


0:33 





Pooled Data for Protection against EF6796 by various FspAs | 



Exp« 



CPU 

WU2 



Mice 



Days to Death/ inununogen 



Rxl 
BClOO 



JS1020 
(BC9739) 



U190S 



jS3020 1550103 FL 0BL5 



(DBUA) 



(DBLS) 



bdOO 



E140 



3.75 



CBA/N 



4x>21 



1,1,1 



£28 



BALE n.v 



Pool of Pools for protection agaisnt EF6796 


line 


Group 
Description 


Delay in time to death and/or survival 


Trotection against death 


days to death (medain DOD) 


P values etc 


«ltye:d«ad 


P values etc 


la 


Rxl 


4x>2J (>2l) 


0l029 


4:0 


0.029 


lb 


Rxl controb 


U.1 (1) 




0:3 





llOD 



wo 97/09994 



PCT/US96/14819 



Pooled Data for Protection against BG7322. by various FL- PspAs and bclOOs 


Exp. 


CFL 
8C 


Mice 


Days to Death/ immunogen 


R36A 


iUl 
BOOO 


JDMB 
(WU2) 


bdOO 


EF3296 


EFSMt 
• 


MOO 


ISSOIQJ 
D8L5 


bcIOO 
DBL5 


ISJ020 
DBL6A 


cootrol 


£171 




CBA/^ 






10,15. 
3ji>21 
















1,3,6,6,7 


5143 


xo 


CBA/^ 












7,ea>10 










8J 


£140 
BOOO 


X14 


CBA/h 




4ji>21 


















).6,^>21 


EISZ 


Xl\ 


CBA/h 








aui6« 

>21 






>2I 




>2t >2t 
>21. >2t 




^7,7,8,01 


Q46 


157 


CBA/h 


3ar>21 








\3a«,10 






WDLll, 

n.i9 




i^flwll. 
iaw>2t 


4.5,5,6. 
>21 
























lnvnuno|BBns 


R36A/IUI/D39 




OBU 


OBUA 


Cant 


J^Dols 




lais, 

Sk>21 


1X1X16. 
>2t 


Sw5a«wlO 


7,t»>2l 


iau.13. 

>2l 


6.1041. 
1U9 
>21.>2t>2t>21 


ta.it 
ia.>2i 


U3S2.XX 
44. 9a 5, 

«»7,4«1 
9,14, 
2k>21 


median d*Y alive 


>21 


>21 


14J 


6 


>21 


12J 


>21 


11 


6 


alive : dead 


9:0 


3:2 


1:3 


0:S 


B:l 


1:3 


4:5 


1:3 


2:32 


F -dayt alive 


<0i)001 


0-0007 


aool 


fU. 


<0.000l 


aooi3 


0.0002 






P - alive : dead 


<0i}001 


0004 




iLa. 


<OX)001 


oa. 


0J)076 






% alive 


100 


60 


25 


0 


89 


25 


80 


25 


6 


% protccttd 


100 


57 


22 


0 


88 


22 


79 


22 


6 


ScDse 


■»»♦ 




»± 


0 




♦t 




♦± 




3G7323 ChaUeng« 




DBU 




1 CmL 



BC7322 
ChLlUn;e 


median dayad 
death 


alive : 
dead 


P value 
based OA days 
to death 


Pvahtc* 
baaed on 
aUvc :dcad 


Scocv 


% 

aUve 


% 

prot 


AUiaumme 


>21 


30:25 


<aoooi 


<OJ0OD\ 




55 


52 


AUcontrob 


6 







ItoP 



wo 97/09994 



PCr/US96/14819 



EXAMPLE 8 - Ability of PspA immunogens to protect against 
individual challenge strains 

In example 7 some of the capsular type 2, 4, and 5 
strains were not completely protected from death by immunization. 
In these studies the BALB/cByJ mouse was used instead of the 
hypersusceptible, immunodef icient CBA/N mouse used for the 
Example 7 studies. With the BALB/cJ mouse it was observed that 
immunization with PspA was in fact able to pjrote^t against death 
with capsular type 2, A, and 5 pneumococci. This result is shown 
in the table below. 

The data from Table 60A also demonstrates that a 
mixture of 4 - 5 full length PspAs was as effective, or more 
effective than immunization with a single PspA. 
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BXAMpLE 9 - CHARACTERIZATION OP PspA EPITOPES WITHIN 

PNEUMOCOCCAL STRAINS MC2S-2e 

V — ' — 

The strains examined came from a group of 13 capsular 
serotype 6B strains which have been identified that are members 
of a multiresistant clone, having resistance to penicillin, 
Chloramphenicol, tetracycline, and some have acquired resistance 
to erythromycin. The pneumococcal isolates described in the 
following studies (MC25-28) are members of this 6B clone. 
Although previously thought to be geographically restricted to 
Spain (unlike the widespread multiresistant Spanish serotype 23F 
clone) , members of this clone have been shown to be responsible 
for an increase in resistance to penicillin in Iceland (Soares, 
S., et al., J. Infect. Dis. 1993; 168 ; 158-163). 

The following techniques were used to characterize the 
location of difference PspA epitopes: 

Bacterial cell culture. Bacteria were grown in Todd-Hewitt broth 
with 0.5% yeast extract or on blood agar plates overnight at 37«c 
in a candle jar. Capsular serotype was confirmed by cell 
agglutination using Danish antisera (Statens Seruminstitut, 
Copenhagen, Denmark). The isolates were subtyped as 6B by 
Quellung reaction, utilizing rabbit antisera against 6A or 6B 
capsule antigen. 

Bacterial lysates. Cell lysates were prepared by incubating the 
bacterial cell pellet with 0.1% sodium deoxycholate, 0.01% sodium 
dodecylsufate (SDS) , and 0.15 M sodium citrate, and then diluting 
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the lysate in 0»5M Trls hydrochloride (pH 6-8). Total 
pneumococcal protein in the lysates was quantitated by the 
bicinchoninic acid method (BCA Protein Assay Reagent; Pierce 
Chemical Company, Rockford, IL) . 

PspA sBratyping^ Pneumococcal cell lysates were subjected to 
SDS-PAGE, transferred to nitrocellulose membranes, and developed 
as Western blots using a panel of seven MAbs to PspA. PspA 
serotypes were assigned based on the particular combination of 
MAbs with which each PspA was reactive. 

Colony immunoblotting . A ten mL tube of Todd-Hewitt broth with 
0.5% yeast extract was inoculated with overnight growth of MC25 
from a blood agar plate. The isolate was allowed to grow to a 
concentration of lo'' cells/mL as determined by an CD. of 0.07 at 
590nm. MC25 was serially diluted and spread-plated on blood agar 
plates to give approximately 100 cells per plate. The plates 
were allowed to grow overnight in a candle jar, and a single 
blood agar plate with well-defined colonies was selected. Four 
nitrocellulose membranes were consecutively placed on the plate. 
Each membrane was lightly weighted and left in place for 5 min. 
In order to investigate the possibility of phase-variation 
between the two proteins detected on Western blots a single 
colony was picked from the plate, resuspended in ringer's 
solution, and spreadplated onto a blood agar plate. The 
membranes were developed as Western blots according to PspA 
serotyping methods. 
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When the strains MC25-28 were examined with the panel 
of seven MAbs specific for different PspA epitopes, all four 
demonstrated the same patterns of reactivity (Fig. 14) . The MAbs 
XiR278 and 2A4 detected a PspA molecule with an apparent 
molecular weight of 190 kDa in each isolate. In accordance with 
the PspA serotyping system, the 190 kDa molecule was designated 
as PspA type 6 because of its reactivity with XiR278 and 2A4, but 
none of the five other MAbs in the typing system:" Each isolate 
also produced a second PspA molecule with an apparent molecular 
weight of 82 kDa. The 82 kDa PspA of each isolate was detected 
only with the MAb 7D2 and was designated as type 34. No 
reactivity was detected with MAbs Xil26, Xi64, 1A4 , or SR4Wr. 
Results from the colony imrounoblotting showed that both PspAs 
were present simultaneously in these isolates under in vitro 
growth conditions. All colonies on the plate, as well as all of 
the progeny form a single colony, reacted with MAbs XiR278, 2A4, 
and 7D2. 

EXAMPI^E 10 - SOOTHERN BLOT ANALYSIS OF CHROMOSOMAL DNA ISOLATED 
FROM PNEOMOCOCCAL STRAINS MC25-28 

Pneumococcal chromosomal DNA was prepared by the 

Youderian method (Sheffield, J.s., et al., Biotechniques, 1992; 

12.1 836-839). Briefly, for a 500 ml culture in THY or THY with 

1% choline, cells were centrifuged at 8000 rpm in GSA rotor for 

30 minutes at 4»C. The supernatant was decanted, and the cells 

were washed with 1 to 2 volumes of sterile water to remove 
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choline, if used. This step was only necessary when sodium 
deoxycholate was used. The wasted cells were centrifued twice a 
8000 rpm in GSA rotor for 10 minutes. Cells were resuspended in 
3,5 ml TE buffer, containing 1% SDS or 1% sodium deoxycholate, 
and incubated at 37«>C for 15 minutes if sodium deoxycholate was 
used. If SDS was used, incubation at 37 was not necessary. 
The cells were incubated at 65«C for 15 minutes, and 1/5 volume 
of 5.0 M potassium acetate was added, and the cell suspension was 
incubated for 30 minutes at 65*^0. 

The cells were placed on ice for 60 minutes, and 
centrifuged at 12,000 rpm in an SS-34 rotor for 10 minutes. The 
supernatant was transferred to a clean centrifuge tube, and 2 
volumes of cold 95% ethanol was added. After mixing, DNA was 
spooled on to a glass pasteur pipet, and air dried. The DNA was 
resuspended in 4 ml TE, and 4.0 g cesium chloride was added. The 
solution was split into two aliquots in ultracentrifuge tubes, 
and the tubes were filled to their maximum capacity using 
1.0 g/ml cesium chloride in TE. Before closing the tubes, 300 ml 
of 10 ug/ml ethidium bromide was added. 

The solution was centrifuged at 45,000 rpm overnight, 
or for 6 hours at 55,000 rpm. The chromosomal band was extracted 
using a gradient, at least 6 times with 1 volume each salt- 
saturated isopropanol. The aqueous phase was extracted by adding 
2 volumes 95% ethanol. The DNA came out of solution immediately, 
and it was spooled on to a pasteur pipet. The DNA pellet was 
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washed by dipping the spooled DNA in 5 ml 70% ethanol. The DNA 
was air dried, and resuspended in the desired volume of TE, e.g., 
500 ul. 

The cells were harvested, washed, lysed, and digested 
with 0.5% (st/vol) SDS and lOO/xg/mL proteinase K at 37«C for 1 h. 
The cell wall debris, proteins, and polysaccharides were 
complexed with 1% hexadecyl trimethyl ammonium bromide (CTAB) and 
0.7M sodium Chloride at SS-C for 20 min., and then extracted with 
chloroform/ isoamyl alcohol. DNA was precipitated with 0.6 
volumes isopropanol, washed, and resuspended in lOmM Tris-HCl, 
ImM EDTA, pH 8.0. DNA concentration was determined by 
spectrophotometric analysis at 260 nm (Meade, H.M. et al., j. 
Bacterid 1982; 149: 114-122; Silhavy, T.J. et al., ExDeri.n.ni-« 
wi,th Gene Fusion , Cold Spring Harbor: Cold Spring Harbor 
Laboratory, 1984; and Murray, M.G. , et al., Nucleic Acids Res. 
1980; 8 4321-4325) . 

PrabB preparation. 5' and 3' oligonucleotide primers homologous 
with nucleotides to 26 and 1967 to 1990 of Rxl pspA (LSM13 and 
LSM2, respectively) were used to amplify the full length pspA and 
construct probe LSMpspA13/2 from Rxl genomic DNA. 5' and 3' 
oligonucleotide primers homologous to nucleotides 161 to 187 and 
nucleotides 1093 to 1117 (LSM12 and LSM6, respectively) were used 
to amplify the variable a-helical region to construct probe 
LSMpspA12/6. PGR generated DNA was purified by Gene Clean 
fBiolOl Inc., Vista, CA) and random prime-labeled with 
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digoxigenin-ll-dUTP using the Genius l Nonradioactive DNA 
Labeling and Detection Kit as described by the manufacturer 
(Boehringer Mannheim, Indianapolis, IN) . 

DIM electrophoresis. For Southern blot analysis, approximately 
lO^xg of chromosomal DNA was digested to completion with a single 
restriction endonuclease (Hind III, Kpn 1, EcoRI, Dra I, or Pst 
I), then electrophoresed on a 0.7% agarose gel for 16-48 h at 35 
volts. For PGR analysis, 5>L of product were incubated with a 
single restriction endonuclease (Bel 1, BamH I, Bst I, Pst I, Sac 
I, EcoR I, Sma I, and Kpn I), then electrophoresed on a 1.3% 
agarose gel for 2-3 h at 90 volts. In both cases, 1 kb DNA 
ladder was used for molecular weight markers (BRL, Gaithersburg, 
MD), and gels were stained with ethidium bromide for 10 min and 
photographed with a ruler. 

Southern blot hybridization. The DNA in the gel was depurinated 
in 0.25N HCl for 10 min, denatured in 0.5M NaOH and 1.5M NaCl for 
30 min, and neutralized in 0.5M Tris-HCl (pH 7.2), 1.5M NaCl and 
imM disodium EDTA for 30 min. DNA was transferred to a nylon 
membrane (Micron Separations INC, MA) using a POSIBLOT pressure 
blotter (Stratagene, LaJolla, CA) for 45 min and fixed by UV 
irradiation. The membranes were prehybridized for 3 h at 42«»C in 
50% formamide, 5X SSC, 5X Denhardt solution, 25mM sodium 
phosphate (pH 6.5), 0.5% SDS, 3% (wt/vol) dextran sulfate and 
500Mg/mL of denatured salmon sperm DNA. The membranes were then 
hybridized at 42 »c for 18 h in a solution containing 45% 
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fonnamide, 5X SSC, ix Denhardt solution, 20inM sodium phosphate 
(PH 6.5), 0.5% SDS, 3% dextran sulfate, 250/iig/inL denatured 
sheared salmon sperm DNA, and about 20ng of heat-denatured 
digoxigenin-labeled probe DNA. After hybridization, the 
membranes were washed twice in 0.1% SDS and 2X SSC for 3 min at 
room temperature. The membranes were washed twice to a final 
stringency of 0.1% SDS in 0.3X SSC at 65 "C for 15 min. This 
procedure yielded a stringency greater than 95 percent. The 
membranes were developed using the Genius l Nonradioactive DNA 
Labeling and Detection Kit as described by the manufacturer 
(Boehringer Mannheim, Indianapolis, IN). To perform additional 
hybridization with other probes, the membranes were stripped in 
0.2N NaOH/0.1% SDS at 40oc for 30 min and then washed twice in 2X 
SSC. 

PCR. 5' and 3' primers homologous with the DNA encoding the N- 
and C-terminal ends of PspA (LSM13 and LSM2, respectively) were 
used. Reactions were conducted in 50/iL volumes containing 0.2mM 
of each dNTP, and l^L of each primer at a working concentration 
of 50mM. MgCl2 was used at an optimal concentration of l.75mM 
with 0.25 units of Tag DNA polymerase. Ten to thirty ng of 
genomic DNA was added to each reaction tube. The amplification 
reactions were performed in a thermal cycler (M.J. Research, 
Inc.) using the following three step program: Step l consisted 
of a denaturing temperature of 940c for 2 min; step 2 consisted 
of 9 complete cycles of a denaturing temperature of 940c for 1 
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inin, an annealing temperature of 50*>C for 2 min, and an extension 
temperature of 72 for 3 min; step 3 cycled for 19 times with a 
denaturing temperature 94«>C for l min, an annealing temperature 
of 60**C for 2 min, and an extension temperature of 72 •C for 3 
min; and at the end of the last cycle, the samples were held at 
72 for 5 min to ensure complete extension. 

Band size estimation. Fragment sizes in the molecular weight 
standard and in the Southern blot hybridization patterns were 
calculated from migration distances. The standard molecular 
sizes were fitted to a logarithmic regression model using Cricket 
Graph (Cricket Software, Malvern, PA). The molecular weights of 
the detected bands were estimated by entering the logarithmic 
line equation obtained by Cricket Graph into Microsoft Excel 
(Microsoft Corporation, Redmond, WA) in order to calculate 
molecular weights based on migration distances observed in the 
Southern blot. 

Since most strains contain a pspA gene and a pspC gene, 
it was expected that if an extra gene were present one might 
observe at least three pspA homologous loci in isolates MC25-28. 
In Hind III digests of MC25-28 each strain revealed 7.7 and 3.6 
kb bands when probed with LSMpspAl3/2 (Figure 15A and 15C) . in 
comparison, when Rxl DNA was digested with Hind III and 
hybridized with LSMpspA13/2, homologous sequences were detected 
on 9.1 and 4.2 kb fragments, as expected from previous studies 
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with PspA (Figure 15A) . Results consistent with two pspA- 
homologous genes in MC25-28 were obtained with two pspA- 
homologous genes in MC25-28 digested using four additional 
enzymes (Table 61) . 

Table 61. Chromo8on.al RFL Ps with probe LSMpspA13/2 for isolates MC25-28 and 

?nzS^*"'^°" Examined Restriction Fragments 

-— (sizes in kilobases) 

MC25 MC26 MC27 MC2B RXl MC25-2B rxI 

Hind III + + * . ^ 3 g ^ ^ 

'^P" ^ * * * * * 11.6, 10.6 10.6, 9.8 

EcoR 1 + * a A -y ^ 

+ 8.4, 7.6 7.8, 6.6 

* 2.1, 1.1 1.9, 0.9 

^ " >14> 6.1 10.0, 4.0 

The four isolates examined are all members of a single 
clone of capsular type 6B pneumococci isolated from Spain. These 
four isolates are the first in which two PspAs have been 
observed, i.e., PspA and PspC, based on the observation that 
bands of different molecular weights were detected by different 
MAbs to PspA. Mutation and immunochemistry studies have 
demonstrated that all of the different sized PspA bands from Rxl 
are made of a single gene capable of encoding a 69kDa protein, 
supporting the assertion that two PspAs have been observed, i.e., 
PspA and PspC. 

It has been observed that probes for the 5' half of 
pspA (encoding the a-helical half of the protein) bind the pspC 
sequence of most strains only at a stringency of around 90%. 
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With chromosomal digests of MC25-28, it was observed that the 5' 
Rxl probe hSMpspA12/6 (Figure 15D) bound two pspA homologous 
bands at even higher stringency. The same probe bound only the 
pspA containing fragment of Rxl at the higher stringency (Figure 
15B) . 

Further characterization of the pspA gene was done by 
RFLP analysis of PGR amplified pspA from each strain. Since 
previous studies indicated^ that individual strains yielded only 
one product, and since the amplification was conducted with 
primers based on a known pspA sequence, it was assumed that the 
product amplified from each strain represented the pspA rather 
than the pspC gene. When MC25-28 were subjected to this 
procedure, an amplified pspA product of 2.1 kb was obtained from 
each of the four strains. When digested with Hha I, this 
fragment yielded bands of l.l, 0.46, 0.21 and 0.19 kb for each of 
the four isolates. A single isolate, MC25, was analyzed with 
eight additional enzymes. Using each restriction enzyme, the sum 
of the fragments was always approximately equal to the size of 
whole pspA (Figure 16). These results suggested that the 2.1 kb 
amplified DNA represents the amplified product of only a single 
pspA gene. Rxl produced an amplified product of 2.0 kb and five 
fragments of 0.76. 0.468, 0.390, 0.349 and 0.120 kb when digested 
with Hha I as expected from its known pspA sequence. 

There are several possible explanations for the 
observation of PspA and PspC in these strains but not in other 
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strains. All isolates might make PspA and Pspc in culture, but 
HAbs generally recognize only PspA (perhaps, in this isolate 
there has been a recombination between pspC DNA and the pspC 
locus, allowing that locus to make a product detected by MAb to 
PSPA) . All isolates can have PspA and PspC, but the expression 
of one of them generally does not occur under in vitro growth 
conditions. The pspC locus is normally a nonfunctional 
pseudogene sequence that, for an unexplained reason, has become 
functional in these isolates. Results from the colony 
imnunoblotting of these isolates failed to show a detectable in 
vitro phase shift between either PspA type 6 (XIR278 and 2A4) or 
PSPA type 34 (7D2) protein. This strengthens the second 
explanation, and suggests that the second PspA is these isolates 
is due to the pspc gene not being turned off during in vitro 
growth conditions. 

Presumably, in these four strains, the second PspA 
protein is provided by the pspC DNA sequence. At high 
stringency, the probe comprising the coding. region of the a- 
helical half of PspA recognized both pspA homologous sequences of 
MC25-18, but not the pspC sequence of Rxl. The finding indicated 
that the pspc sequence of MC25-28 is more similar to the Rxl pspA 
sequence than the Rxl pspC sequence. if the pspC sequence of 
these strains is more similar to pspA than most pspc sequences, 
it could explain why the products of pspC genes cannot generally 
be identified by MAbs. 
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Example 11 - Identification of conserved and variable regions 

of dsdA and pspc sequence s of s. pneumoniap 

The S. pneumoniae strains used in this study are listed 
in Table 62. The strains are human clinical isolates 
representing 12 capsular and 9 PspA serotypes. All strains were 
grown at 37 -C in lOOml of Todd-Hewitt broth supplemented with 
0.5% yeast extract to an approximate density of 5 x lo^ cells/ml. 
After harvesting of the cells be cehtrifugation (2900 g, lOmin) , 
the DNA was isolated, and stored at 4»C in TE (10 mM Tris, imM 
EDTA, pH8.0) . 
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Table 62. Streptococcus pneumoniae stra 



ins used. 



S trai n 

WU2 
D39 
R36A 

Rxl 
DBL5 
DBL6A 
A66 
AC94 
AC17 
AC40 
AC107 
AClOO 
AC140 
D109-1B 
BG9709 

L81905 
L82233 
L820O6 



Relevant phenotyp e 

Capsular type 3, PspA type 1 

Capsular type 2, PepA type 25 

Nonencapsuated mutant of D39, 
pspA type 25 

Derivatitve of R36A, PspA type 25 

Capsular type 5, PspA type 33 

Capsular type 6A, PspA type 19 

Capsular type 3, PspA type* 13, 

Capsular type 9L, PspA type 0 

Capsular type 9L, PspA type 0 

Capsular type 9L, PspA type 0 

Capsular type 9V, PspA type 0 

Capsular type 9V, PspA type 0 

Capsular type 9N, PspA type 18 

Capsular type 23, PspA type 12 

Capsular type 9, PspA type 0 

Capsular type 4, PspA type 25 
Capsular type 14, PspA type 0 
Capsular type 1, PapA type 0 



Reference 

Briles et al., 1981 
Avery et al,, 1944 
Avery et al., 1944 

Shoemaker and Guild, 1974 
Yother et al., 1986 
Yother et al., 1986 
Avery et al., 1944 
Waltroan et al., 1992 
Waltman et al., 1992 
Waltman et al., 1992 
Waltman et al., 1992 
Waltman et al., 1992 
Waltman et al., 1992 
McDaniel et al., 1992 
McDaniel et al., 1992 

McDaniel et al . , 1992 
McDaniel et al., 1992 
McDaniel et al., 1992 



Approximately S^g of chromosomal DNA was digested with 
Hindlli according to the manufacturer's instructions (Promega, 
inc. , Madison, WI) . The digested DNA was subjected to 
electrophoresis at 35 »v overnight in 0.8% agarose gels and then 
vacuum-blotted onto Nytran® membranes (Schleicher & Schuell, 
Keene, NH) . 
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The oligonucleotides uses were based on the previously 
determined sequence of Rxl pspk. Their position and orientation 
relative to the structural domains of Rxl pspA are shown in 
Figure 17. Labeling of oligonucleotides and detection of probe- 
target hybrids were both performed with the Genius System® 
according to manufacturer's instructions (Boehringer-Mannhein, 
Indianapolis, IN) . All hybridizations were done for 18 hours at 
42*C without formamide. By assuming that 1% base-pair 
mismatching results in a l°c decrease in T^ arbitrary 
designations of "high" and "low" stringency were defined by salt 
concentration and temperature of post-hybridization washes. 
Homology between probe and target sequences was derived using 
calculated T^ by established methods. High stringency is defined 
as ^ 90%, and low stringency is < 85% base-pair matching. 

PGR primers, which were also used as oligonucleotide 
probes in Southern blotting and hybridizations, were designed 
based on the sequence of pspA from pneumococcal strain Rxl. 
These oligonucleotides were synthesized by Oligos, Etc. (Wilson, 
OR) , and are listed in Table 63 . 
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Table 63. Oligonucleotide sequences. 



Primer -> 3* 



LSMlll 


CCGCATCCAGCTCCTGCACCAAAAC 


LSM2 


GCGCGTCGACGCTTAAACCCATTCACCATTGG 


LSM3 


CCGGATCCTGAGCCAGAGCAGTTGGCTG 


hSM 


CCGGATCCGCTCAAAGAGATTGATGAGTCTG 


LSM5 


GCGGATCCCGTAGCCAGTCAGtCTAAAGCTG 


LSH6 


CTGAGTCGACTCGAGTTTCTGGAGCTGGAGC 


VSH7 


CCGGATCCAGCTCCAGCTCCAGAAACTCCAG 


LSM9 


GTTTTTGGTGCAGGAGCTGG 


LSMIO 


GCTATGGCTACAGGTTG 


LSM12 


CCGGATCCAGCGTGCCTATCTTAGGGGCTGGT 


LSM112 


GCGGATCCTTGACCAATARRRACGGAGGAGGC 



PGR was done with an MJ Research, Inc., Programmable 
Thermal Cycler (Watertown, MA) , using approximately lO ng of 
genomic pneumococcal DNA as template with designated 5' and 3' 
primer pairs. The sample was brought to a total volume of 50 /il 
containing a final concentration of 50 mM KCl, lo mM Tris-HCl (pH 
8.3), l.SmM MgClj, 0.01% gelatin, 0.5 of each primer, 200 /xM 
of each deoxynucleoside triphosphate, and 2.5 U of Tag DNA 
polymerase. The samples were denatured at 94*>C for 2 minutes and 
subjected to 10 cycles consisting of: i min at 94°C, 2 min at 
50^C, and 3 min at 72°C, followed by 20 cycles of: 1 min at 9A°c, 
2 min at 60°C, and 3 min at 72°C. After 30 total cycles, the 
samples were held at 72°C for an additional 5 min prior to 
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cooling to 4°C. The amplicons were then analyzed by agarose gel 
electrophoresis . 

Oligonucleosides were used to probe Hindlll digests of 
DNA from 18 strains of s. pneumoniae under conditions of low and 
high stringency. Each strain was also screened using a full- 
length pspA probe. Table 64 summarizes the results for each 
strain under conditions of high stringency. Strain Rxl is a 
laboratory derivative of the clinical isolate D39 and 
consequently, both strains showed identical hybridization 
patterns and are a single column in Table 64. 
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The only strain which did not have more than one pspA- 
homologous Hindlll fragment was WU2, which was previously shown 
using a full-length pspA probe. Even at high stringency, six of 
the eight probes detected more than one fragment in at least one 
of the 18 strains (Table 64). LSM7, lo and 12 hybridized with 
two fragments in more than one-half of the strains, and the 
fragments detected by the oligonucleotide probes were identical 
in size to those detected by the full-length pspA probe. 
Moreover, the same pairs of fragments were frequently detected by 
probes derived from the 3' as well as the 5' region of Rxl pspA. 
These results suggested that the Hindlll fragments from different 
isolates include two separate but homologous sequences, rather 
than fragments of a single pspA gene. Based on the diversity of 
the hybridization patterns and the size of restriction fragments, 
it is clear that pspA and pspC sequences are highly diverse and 
that these loci have considerable sequence variability as 
determined by location of Hindlll recognition sites. 

Oligonucleotides which hybridize with a single 
restriction fragments in each strain were assumed to be specific 
for pspA. At high stringency, LSM3 and LSM4 detected only a 
single Hindlll fragment in the strains with which they reacted. 
Restriction fragments containing homology to LSM3 or LSM4 were 
the same as those which hybridize with all of the other 
homologous probes. This suggested that LSM3 and LSM4 
specifically detect pspA rather than the pspC sequence. That 
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LSM3 hybridizes with a single restriction fragment of WU2 further 
confirmed that this oligonucleotide is specific for pspA. 
Sequences from the portion of the gene encoding the second 
proline region (LSMlll) and the C-terminus (LSM2) appeared to be 
relatively specific for pspA since they generally detect only one 
of the Hindlll fragments of each strain. 

Oligonucleotides LSM12 and LSMIO were able to detect 
the most conserved epitopes of pspA and generally hybridize with 
multiple restriction fragments of each strain (Table 65) . LSM7 
was not as broadly cross-reactive, but detected two pspAs in 41% 
of strains including almost 60% of the strains with which it 
reacts. Thus, sequences representing the leader, first proline 
region, and the repeat region appear to be relatively conserved 
not only within pspA but between the pspA and pspC sequences. 
LSM3, 4, and 5 hybridize with the smallest number of strains of 
any oligonucleotides (29-35 percent) , suggesting that the a- 
helical domain is the least conserved region within pspA. In 
strains BG58C and L81905 oligonucleotides detect more than two 
Hindlll fragments containing sequences with homology to pspA. 
Because of the absence of ifindlll restriction sites within any of 
the oligonucleotides it was unlikely that these multiple 
fragments result from the digestion of chromosomal DNA within the 
target regions. Also, the additional restriction fragments were 
detected at high stringency by more than one oligonucleotide. 
Possibly, in these two strains, there are three or four sequences 
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with DNA homology to some portions of pspA. The probes most 
consistently reactive with these additional sequences are those 
for the leader, the alpha-helical region, and the proline-rich 
region. 

The oligonucleotides used as hybridization probes were 
also tested for their utility as primers in the polymerase chain 
reaction (PGR). Amplification of pspA from 14 strains of s. 
pneumoniae comprising 12 different capsular types was attempted 
with the primers listed in Table 63. LSM2, derived from the 3' 
end of pspA, were able to amplify an apparent pspA sequence from 
each of 14 pneumococcal strains when used in combination with 
LSMlll, which is within the sequence of pspA encoding the 
proline-rich region. Combinations of LSM2 with primers upstream 
in pspA were variably successful in amplifying sequences (Table 
65) . The lowest frequency of amplification was observed with 
LSM112 which was derived from the Rxl sequence 5' to the pspA 
start site. This oligonucleotide was not used in the 
hybridization studies. DNA fragments generated by PGR were 
blotted and hybridized with a full-length pspA probe to confirm 
homology to pspA. 

Further evidence for variability at the pspA locus 
comes from the differences in the sizes of the amplified pspA 
gene. When PGR primers LSM12 and LSM2 were used to amplify the 
entire coding region of PspA, PGR products from different 
pneumococcal isolates ranged in size from 1.9 to 2.3 kbp. The 
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regions of pspA which encode the Q-helical, proline-rich, and 
repeat domains were amplified from corresponding strains and 
variation in pspA appears to come from sequences within the a- 
helical coding region. 

Table 65. Amplification of pspA by PCR using the indicated 

3' combination :ith the 



5' - primer Domain „ Amplified/ Percent 
tested Amplified 



LSM112 -35 (upstream) 2/14 

LSM12 leader 
LSM3 a-helical 
proline 



14 

8/14 57 
3/14 21 
12/14 86 



LSMlll proline 14/14 



100 



These studies have provided a finer resolution map of 
the location of conserved and variable sequences within pspA. 
Additionally, regions of divergence and identity between pspA and 
the pspc sequences have been identified. This data confirmed 
serological studies, and demonstrated that pspA and pspC 
sequences are highly variable at the DNA sequence level. The 
diversity of HindllZ restriction fragment polymorphisms contained 
pspA and the pspC sequence supported earlier data using larger 
probes that detected extensive variability of the DNA in and 
around these sequences. 
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A useful pspA-specif ic DNA probe would identify Rxl and 
WU2 pspA genes, in which restriction maps are known, and would 
identify only a single restriction fragment in most strains. Two 
probes, LSM3 and LSM4, do not hybridize with more than one 
Hindlll restriction fragment in any strain of pneumococcus . Both 
of these oligonucleotides hybridize with Rxl pspA and LSM3 
hybridizes with WU2 pspA. However, each of these probes 
hybridize with only four of the ather 15 strains. When these 
probes identify a fragment, however, it is generally also 
detected by all other Rxl-derived probes. Oligonucleotides from 
the second proline-rich region (LSMlll) and the C-terminus of 
pspA (LSM2) generally identify only one pspA-homologous sequence 
at high stringency. Collectively, LSMlll, 2, 3 and 4 react with 
16 of the 17 isolates and in each case revealed a consensus DNA 
fragment recognized by most or all of the oligonucleotide probes. 

When an oligonucleotide probe detected only a single 
DNA fragment it was presumed to be pspA. If the probe detected 
multiple fragments, it was presumed to hybridize with pspA. If 
the probe detected multiple fragments, it was presumed to 
hybridize with pspA and the pspC sequence. Based on these 
assumptions the most variable portion between pspA and pspC is 
the region immediately upstream from the -35 promoter region and 
that portion encoding the a-helical region. The most conserved 
portion between pspA and pspC was found to be the repeat region, 
the leader and the proline-rich region sequences. Although only 
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one probe from within the repeat region was used, the high degree 
of conservation among the 10 repeats in the Rxl sequence makes it 
likely that other probes within the repeat sequences would give 
similar results. 

The portion of Rxl psph most similar to the pspc 
sequence was that encoding the leader peptide, the upstream 
portion of the proline-rich region, and the repeat region. The 
repeat region of PspA has been shown to be involved in the 
attachment of this protein to the pneumococcal cell surface. The 
conservation of the repeat region within pspC sequences suggests 
that if these loci encode a protein, it may have a similar 
functional attachment domain. The conservation of the leader 
sequence between pspA and the pspC sequence was also not 
surprising since similar conservation has been reported for the 
leader sequence of other proteins from gram positive organisms, 
such as M protein of group A streptococci (Haanes-Fritz, E. et 
al., Nucl. Acids Res. 1988; 16: 4667-4677). 

In two strain, some oligonucleotide probes identified 
more than two pspA-homologous sequences, in these strains, there 
was a predominant sequence recognized by almost all of the 
probes, and two or three additional sequences share homology with 
DNA encoding the leader, a-helical, and proline region, and they 
have no homology with sequences encoding the repeat region in the 
C-terminus of PspA. These sequences might serve as cassettes 
which can recombine with pspA and/ or the pspC sequences to 
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generate antigenic diversity. Alternatively, the sequences might 
encode proteins with very different C-terminal regions and might 
not be surface attached by the mechanism of PspA. 

Oligonucleotides which hybridize with a single 
chromosomal DNA fragment were used as primers in PGR to examine 
the variability of domains within pspA. These results 
demonstrate that full-length pspA varies in size among strains of 
pneumococci, and that this'^variability is almost exclusively the 
result of sequences in the alpha-helix coding region. 
Example 12 - Cloning of PspC 

Chromosomal DNA from S. pneumoniae EF6796, serotype 6A 
clinical isolate, was isolated by methods including purification 
through a cesium chloride gradient, as described in Example 8. 
The Kindll-^rcoRI fragment of EF6796 was cloned in modified pZero 
vector (Invitrogen, San Diego, CA) in which the Zeocin-resistance 
cassette was replaced by a kanamycin cassette (shown in Figure 
18) . Recombinant plasmids were electroporated into Escherichia 
coli TOPIOF' cells (F' {iacI^Tet^} mcrA A (Tnrr-hsdRMS-mcrBC) 
<^»801acZAM15 4IacX74 deoR recAl ajraDl29 A(ara-leu)19Sl galU galK 
rpsL endAl nupG] (Invitrogen) . 

The 5' region of pspA.Rxl does not hybridize to pspC 
sequence at high stringencies by Southern analysis. Utilizing 
both the full-length Rxl pspA probe, and a probe containing the 
sequence encoding a-helical region of PspA, it was possible to 
identify which DNA fragment contained pspA and which fragment 
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contained the pspC locus. The pspC locus and the pspA gene of 
EF6796 were mapped using restriction enzymes. After digestion of 
chromosomal DNA with Hindlll, the pspC locus was localized to a 
fragment of approximately 6.8 kb. Following a double digest with 
Hindlll and EcoKL, the pspC locus was located in a 3.5 kb 
fragment. To obtain the intact pspC gene of EF6796, chromosomal 
DNA was digested with Hindlll, separated by agarose gel 
electrophoresis, the region between is and 7.5 kb purified, and 
subsequently digested with EcoRl. This digested DNA was analyzed 
by electrophoresis, and DNA fragments of 3.0 to 4.0 kb were 
purified (GeneClean, BiolOl, inc.. Vista, CA) . The size- 
fractionated DNA was then ligated in //indlll-rcoRI -digested 
pZero, and electroplated into E. coli TOPIOF' cells. Kanamycin- 
resistant transf ormants were screened by colony blots and probed 
with full-length pspA. A transf ormant, LXS200, contained a 
vector with a 3.5 kb insert which hybridized to pspA. 

Sequencing of pspC in pLXS200 was completed using 
automated DNA sequencing on an ABI 377 (Applied Biosystems, inc., 
PLACE) . Sequence analyses were performed using the University Of 
Wisconsin Genetics Computer Group (GCG) programs supported by the 
Center for AIDS Research (P30 AI27767) , MacVector 5.0, Sequencer 
2.1, and DNA Strider programs. Sequence similarities of pspC 
were determined using the NCBI BLAST server. The coiled-coil 
structure predicted by pspC sequence was analyzed using Matcher. 
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A gene probe for cloning the pspC locus. Two oligonucleotide 
primers, N192 and C558 (shown in Figure 19) , have been used 
previously to clone fragments homologous to the region of Rxl 
psph encoding amino acids 192-588 from various pneumococcal 
strains. These primers are modifications (altered restriction 
sites) of LSM4 and LSM2 which were previously shown to amplify 
DNA encoding the C-terminal 396 amino acids of PspA.Rxl (Figure 
17); this includes approximately 100 ainlno acids of the a-helical 
region, the proline rich region, and the C-terminal choline- 
binding repeat region. Using primers N192 and C558, a 1.2 kb 
fragment from strain EF6796 was amplified by PCR, and 
subsequently cloned in pET-9A (designated PRCT135) . This insert 
was then partially sequenced. 

Independently, a larger pspA fragment from strain 
EF6796 was made using primers LSM13 and SKH2 (shown in Figure 19) 
for the purpose of direct sequencing of serologically diverse 
pspA genes. 

The LSM13 and SKH2 primer pair result in the 
amplification of the 5' end of most pspA gene(s) encoding the 
upstream promoter, the leader peptide, the a-helical, and the 
proline-rich regions (amino acid -15 to 450) (Figure 20) . From 
the strain EF6796, the LSM13 and SKH2 primers amplified a 1.3 kb 
fragment (pspA.EF6796) , which was sequenced. The sequence from 
pRCT135 and the LSM13/SKH2 PCR-generated fragment pspA,EF6796 was 
not identical. The fragment obtained by PCR using primers LSM13 
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and SKH2 was designated pspA based on its location within the 
same chromosomal location as pspA.Rxl. The cloned fragment in 
PRCT135 was assumed to represent the sequence of the second gene 
locus, pspC, known to be present from Southern analysis. Both 
genes have significant similarity to the corresponding regions of 
the prototype pspA gene from strain Rxl. The second gene locus 
was called pspC, in recognition of its distinct chromosomal 
location, not sequence differences from the prototype pspA gene. 

Analysis of the nucleotide and amino acid sequence of pspC 
EF6796. To test the hypothesis that pRCT135 represented pspC of 
EF6796, and to further investigate pspC, the entire EF6796 pspC 
gene was cloned as a 3.4 kb /^indl II -£coi?l fragment forming 
PLXS200. DNA sequence of the pspC-containing clone pLXS200 
revealed an open reading fram of 2782 nucleotides based on the 
analysis of putative transcriptional and translation start and 
stop sites (Figure 21) • The predicted open reading frame encodes 
a 105 kDa protein which has an estimated pi of 6.09. 

PspA. Rxl and PspC.EF6796 are similar in that they both 
contain an a-helical region followed by a proline-rich domain and 
repeat region (Figure 20) . However, there are several features 
of the amino acid sequence of PspC which are quite distinct from 
PspA. From comparisons at the nucleotide as well as the 
predicted amino acid sequence, it is apparent that the region of 
strong homology between PspC and PspA begins at amino acid 458 of 
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PspC (amino acid 147 of PspA) and extends to the C-terminus of 
both proteins (positions 899 and 588 respectively) . The 
predicted amino acid sequence of PspC.EF6796 and PspA.Rxl are 76% 
similar and 68% identical based on GCG Bestfit program for this 
region (Figure 22) . The nucleotide sequence identity between 
pspC and psph is 87% for the same region* Eight bases upstream 
of the ATG start site is putative ribcsoroal binding site, 
TAGAAGGA. The proposed transcriptional start -35 (TATACA) and - 
10 (TATAGT) regions are located between 258 to 263 and 280 to 
285, respectively (Figure 21) . A potential transcriptional 
terminator occurs at a stem loop between nucleotides 3237 through 
3287, The putative signal sequence of PspC is typical of other 
gram positive bacteria. This region consists of a charged region 
followed by a hydrophobic core of amino acids. A potential 
cleavage site of the signal peptide occurs at amino acid 37 
following the Val-His-Ala. The first amino acid of the mature 
protein is a Glu residue. 

Other than features similar to all signal sequences, 
there is no homology in this region between pspA and pspC. This 
confirms that pspC is present in a separate chromosomal locus 
from that of pspA. The signal sequence and upstream region have 
striking similarity to the similar regions of S. agalactxae B 
antigen (accession number X59771) . The Bantigen of Group B 
streptococci is a cell surface receptor that binds IgA. 
Similarity to the bac gene ends with the start of the mature 
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protein of PspC, and the nucleotides are 75% identical in this 
region. Thus, although pspC is in a very similar chromosomal 
locus to the 6 antigen, it is clearly a distinct protein. 

The N-terminus of PspC is quite different from the N- 
terminus of PspA. Prediction of the secondary structure 
utilizing Chou-Fausman analysis (Chao, P.y. et al.. Adv. Enzymol. 
Relat Areas Mol. Biol. 1978: 47: 45-148), suggests that the 
structure of amino acids 16 to 589 of PspC is predominately o- 
helical. The Matcher program was used to examine periodicity in 
the tt-helical region of PspA. The characteristic seven residue 
periodicity is maintained by having hydrophobic residues at the 
first and fourth positions (a and d) and hydrophobic residues at 
the remaining positions. The coiled-coil region of the a-helix 
of PspC (between amino acid 32 to 600) has three breaks in the 
heptad repeat (Figure 2 3) . These disturbances in the 7 residue 
periodicity occur at amino acids 99 to 104, 224 to 267 and 346 to 
350. The a-helical region of PspA has seven breaks in the motif, 
each break ranging from a few amino acids to 23 amino acids each. 
In contrast, the three breaks in the coiled-coil motif of PspC 
involve 5, 43 and 4 amino acids, respectively. 

The sequence encoding the a-helical region of PspC 
contains two direct repeats 483 nucleotides (160 amino acids) 
long which are 88% percent identical at the nucleotide level. 
These repeats, which occur between nucleotides 562 to 1045 and 
nucleotides 1312 to 1795, are conserved both at the nucleotide 
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and amino acid level (amino acids 188 to 348 and 438 to 598) 
(Figure 24) . PspA lacks evidence for any repeats this prominent 
within the a-helical region* These repeat regions could provide 
a mechanism for recombination that could alter the N-terminal 
half of the PspC molecule. Although repeat motifs are common in 
bacterial surface proteins, a direct repeat this large or 
separated by a large spacer region is novel. The evolutionary 
significance of this region is not known. A Blast search of the 
repeat region and the 267 nucleotide bases between them revealed 
no sequence with significant homology at the nucleotide or amino 
acid level. However, one of the structural breaks in the coil- 
coiled region of PspC is the region between the two repeats. 
Perhaps some deviation from coiled-coil structure between the two 
repeats is critical to maintain the a-helical structure. 

Previous studies have shown that a major cross- 
protective region of PspA comprises the C-terminal 1/3 of the ot- 
helical region (between residues 192 and 260 of PspA.Rxl) . This 
region accounts for the binding of 4 of 5 cross-protective 
immunity in mice. Homology between PspC and PspA begins at amino 
acid 148 of PspA, thus including the region from 192 - 299- The 
homology between PspA and the PspC includes the entire PspC 
sequence C-terminal of amino acid 486. Based on the fact that 
PspA and PspC are so similar in this region known to be 
protection-eliciting, PspC is also likely to be a protection- 
eliciting molecule. Because of close sequence and conformational 
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similarity of the proteins in this region, antibodies specific 
for the region of PspA between amino acid 148 and 299 should 
cross-react with PspC and thus afford protection by reacting with 
PspC and PspA. Likewise, immunization with the PspC would be 
expected to elicit antibodies cross-protective against PspA. The 
differences between PspC of strain EF6796 and PspA of strain Rxl 
is no greater than the differences between many additional PspAs, 
which have been shown to be highly cross-protective. 

A proline-rich domain exists between amino acid 590 to 
652. The sequence, PAPAPEK, is repeated six times in this 
region. This region is very similar to the proline-rich region 
of PspA. Rxl which contains the sequence PAPAP repeated eight 
times in two proline-rich regions. These two regions of PspA. Rxl 
are separated by 27 charge amino acids; no such spacer region is 
present in PspC. 

Many cell surface proteins of other gram positive 
bacteria contain proline-rich regions. These are often 
associated with a domain of protein that is predicted to be near 
the cell wall murein layer when the protein is cell-associated. 
For example, in M proteins of S. pyogenes this domain contains 
both a Pro- and Gly-rich regions. The f ibronectin-binding 
protein of S. pyogenes, S. dysgalactiae , and Staphylococcus 
aureus contains a proline-rich region with a three-residue 
periodicity (pro-charged-uncharged) that is not found in PspA or 
PspC. An M-like protein of s. egui contains a proline-rich 
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region that is comprised of the tetrapeptide PEPK, This region 
lacks glycine normally found in the proline regions of M- 
proteins. The last proline repeat region of this molecule is 
PAPAK, which is more similar to the proline-region of PspA and 
PspC than it is to M-proteins. 

Proline-rich regions of gram positive bacterial 
proteins have been reported previously to transit the cell wall. 
The differences in proline-rich regions of proteins from diverse 
bacteria may reflect differences in protein function or possibly 
subtle differences in cell wall function. Proline-rich regions 
are thought to be responsible for aberrant migration of these 
proteins through SDS-polyacrylamide gels. 

The repeat region of PspC is a common motif found among 
several proteins in gram positive organisms. Autolysin of 5. 
pneuiDOJiiae , toxins A and B of Clostridium difficile, 
glucosyltransf erases from S. downei and S, mutans, and CspA of C. 
acetoJbiitylicura all contain similar regions. In PspA these 
repeats are responsible for binding to the phosphatidylcholine of 
teichoic acid and lipoteichoic acid in cell wall of pneumococci. 
However, bacterial proteins containing C-terminal repeats are 
secreted, which may imply either a lost or gained function. 
Although all of these proteins have similar repeat regions the 
similarity of the repeat regions of PspA and PspC is much greater 
than that of PspC to the other proteins (Table 66) . 
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Interestingly, PspC like PspA has a 17 amino acid 
partially hydrophobic tail. The function of this 17 amino acid 
region is unknown. In the case of PspA it has been shown that 
mutants lacking the tail bind the surface of pneumococci as well 
as PspAs in which the tail is expressed. Presently, it is now 
known whether PspC is attached to the cell surface or secreted. 

PspA and PspC proteins both have a-helical coiled-coil 
regions, proline-rich central regions, repeat regions, with a 
choline binding motifs, and the C-terminal 17 amino acid tail. 
PspA and PspC share three regions of high sequence identity. One 
of these is a protection-eliciting region present within the 
a-helical domain. The other two regions are the proline-rich 
domain and a repeat domain shared with other choline binding 
proteins and thought to play a role in cell surface association. 
The similarity throughout most of the structure of the PspA and 
the PspC molecules raises the possibility that the two molecules 
may play at least slightly redundant functions. However, the 
fact that the N-terminal half of the protein is not homologous to 
any of the ot-helical sequence of PspA suggests the PspC and PspA 
may have evolved for at least somewhat different roles on the 
cell surface. One of the most striking differences between the 
two molecules is the single repeat in the a-helical region of 
PspC. Although neither the exact function of PspA nor of PspC 
are known, the observation that a major cross-protective region 
of PspA is highly homologous with a similar region of PspC, 
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raises the possibility that both molecules are protection- 
eliciting and elicit cross-protective antibodies. 

The sequence similarity between the promoter region of 
the pspC gene and the Jbac gene from group B streptococci is very 
intriguing. It implies that an interspecies recombination event 
has occurred and, this interspecies recombination has contributed 
to the evolution of the pspC. The pspC gene thus has a chimeric 
structure, being partially like pspA and partially like the 6 
antigen. In the latter case, all protein similarity is limited 
to the signal sequence. Similar interspecies recombination 
events have contributed to the evolution of the genes encoding 
penicillin binding protein. 

Using analogous procedures, a second PspC sequence was 
isolated from strain D3 9 of S. pneumoniae. Figures 25 to 29 show 
the sequence data of PspC from strain D39, complete from upstream 
of the promoter through the proline-rich region. Strain D39 has 
the same genetic background as strains Rxl, from which pspA was 
sequenced. D39 and Rxl have the same pspC gene based on Southern 
blot analysis. 

The alpha-helical encoding region of the D39 pspC gene 
is one third of the size of the homologous region from the EF6796 
pspC gene. The proline-rich region of the D39 pspC gene was more 
similar to Rxl pspA than to EF6796 pspC. Even so, the two pspC 
genes were 86% identical at the nucleotide sequence, and 67% 
identical at the amino acid level. 
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In the alpha-helical sequence of EF6797 pspC a strong 
repeat was observed. This was absent in the pspC sequence of 
D39. The D3 9 pspC sequence also lacks a leader sequence, found 
in the EF6797 pspC sequence. 

This data strongly indicates that there is variability 
in the structure of pspC, similar to previous observations for 
pspA. In the case of pspC, however, the extent of variability 
appears to be even greater than that which has been observed for 
pspA. 
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Table 66. 



1 PERCENT HOMOLOGY OF 


CHOLINE BINDING 


REGIONS 1 


1 Percent i 

similarity/identity 


Protein Organism 


PspA 


PspC 1 


PepC S . pneumoniae 


86/60 


100/100 


Bacteriophage Gp- S. pneurjonl&e 


56/30 


56/28 


LytA S . pneumoniae 


57/33 


61/32 


PspA C, perfrxngens 


64/45 


59/42 


alpha toxin C. /lovyi 


54/29 


57/33 


CspB C. 

a ce t oJbu t y i i c um 


58/36 


61/45 


Having thus described 


in detail certain preferred 



embodiments of the present invention, it is to be understood that 
the invention defined by the appended claims is not to be limited 
by particular details set forth in the above description, as many 
apparent variations thereof are possible without departing from 



the spirit or scope thereof. 
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WHAT Ifl CIAIMBD J tt- 

1. An isolated amino acid molecule consisting of 
residues 1 to 115, 1 to 260, 192 to 588, 192 to 299, or residues 
192 to 260 of pneumococcal surface protein A of Streptococcus 
pneumoniae . 

2. An isolated DNA molecule consisting of a fragment 
of pneumococcal surface protein A gene of Streptococcus pneumonia 
encoding the isolated amino acid molecule of claim l. 

3. A PCR primer consisting essentially of the isolated 
DNA molecule of claim 2. 

4. A hybridization probe consisting essentially of the 
isolated DNA molecule of claim 2. 

5. An immunological composition comprising the amino 
acid molecule of claim i. 

6. An isolated DNA molecule consisting of nucleotides 
1 to 26, 1967 to 1990, 161 to 187, 1093 to 1117 or 1312 to 1331, 
or 1333 to 1355 of a pneumococcal surface protein A gene of 
Streptococcus pneumoniae . 

7. A PCR primer consisting essentially of the isolated 
DNA molecule of claim 6. 

B. A hybridization probe consisting essentially of the 
isolated DNA molecule of claim 6- 

9. An isolated DNA molecule consisting of a fragment 
of a pneumococcal surface protein A gene of Steptococcus 

178 



wo 97/09994 



PCT/US96/14819 



pneumoniae consisting of a nucleotide sequence (5' to 3') 

selected from 

CCGGATCCAGCTCCTGCACCAAAAAC ; 
GCGCGTCGACGGCTTAAACCCATTCACCATTGG ; 
CCGGATCCTGAGCCAGAGCAGTTGGCTG ; 
CCGGATCCGCTCAAAGAGATTGATGAGTCTG ; 
GCGGATCCCGTAGCCAGTCAGTCTAAAGCTG ; 
CTGAGTCGACTGGAGTTTCTGGAGCTGGAGC ; 
CCGGATCCAGCTCCAGCTCCAGAAACTCCAG ; 
GCGGATCCTTGACCAATATTTACGGAGGAGGC ; 
GTTTTTGGTGCAGGAGCTGG ; 
GCTATGGGCTACAGGTTG ; 
CCACCTGTAGCCATAGC ; 

CCGCATCCAGCGTGCCTATCTTAGGGGCTGGTT ; and 
GCAAGCTTATGATATAGAAATTTGTAAC • 

10. A PGR primer consisting essentially of at least 
one isolated DNA molecule of claim 9. 

11. A hybridization probe consisting essentially of at 
least one isolated DNA molecule of claim 9. 

12. PGR probe (s) which distinguishes between pspA and 
pspA-like nucleotide sequences. 

13. PGR probe (s) which hybridizes to both pspA and 
pspA-like nucleotide sequences. 

14. A PspA extract prepared by a process comprising 
growing pneumococci in a first medium containing 

choline chloride, 

eluting live pneumococci with a choline chloride 
containing salt solution, and 

growing the pneumococci in a second medium containing 
an alkanolamine and substantially no choline. 

15. A PspA extract prepared by 
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growing pneumococci in a first medium containing 
choline chloride, 

eluting live pneumococci with a choline chloride 
containing salt solution, 

growing the pneumococci in a second medium containing 
an alkanolamine and substantially no choline, and purifying PspA 
by isolation on a choline-Sepharose affinity column. 

16. An immunological composition comprising the 
extract of claim 14* 

17. An immunological composition comprising the 
extract of claim 15. 

18. An immunological composition comprising full 
length PspA. 

19. A method for enhancing immunogenic ity of a PspA- 
containing immunological composition comprising including in said 
composition the C-terminal portion of PspA. 

20. An immunological composition comprising at least 

two PspAs. 

21. The immunological composition of claim 20 wherein 
the PspAs are from different groups based on RFLP. 

22. PGR amplification product from a primer as claimed 
in claims 3,7, 10, 12 or 13. 

23. An isolated DNA molecule consisting of a 
nucleotide sequence homologous to a portion of pspA. 
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24. An isolated amino acid molecule comprising 
pneumococcal surface protein C, PspC, of Streptococcus pneumoniae 
having alpha-helical, proline rich and repeat regions. 

25. An isolated DNA molecule comprising a pneumococcal 
surface protein C gene of S, pneumoniae encoding the isolated 
amino acid molecule of claim 24 . 

26. A PCR primer consisting essentially of the 
isolated DNA molecule of claim 25. 

27. A hybridization probe consisting essentially of 
the isolated DNA molecule of claim 25. 

28. An immunological composition comprising the amino 
acid molecule of claim 24. 

29. An isolated amino acid molecule of claim 24 having 
strong homology with pneumococcal surface protein A, PspA, of S. 
pneumoniae from amino acid 458 of PspC, corresponding to amino 
acid 147 of PspA, extending to a C-terminus of PspC and PspA. 

30. An isolated amino acid molecule of claim 24, 
further comprising a signal sequence consisting essentially of a 
charged region followed by a hydrophobic core of amino acids. 

31. An isolated amino acid molecule of claim 24, 
wherein the alpha-helical region further comprises a seven 
residue periodicity and a coiled coil region having three breaks 
in a heptad repeat. 

32. An isolated amino acid molecule comprising 
pneumococcal surface protein PspC, of S. pneumoniae having 
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alpha-helical, proline rich and repeat regions, wherein the 
alpha-helical region comprises a C-terminus having substantial 
homology with a protection-eliciting region of PspA. 

33. An isolated DNA molecule comprising a pneumococcal 
svirface protein C gene of S. pneumoniae encoding the isolated 
amino acid molecule of claim 32. 

34* A PGR primer consisting essentially of the 
isolated DNA molecule of claim 33* 

35. A hybridization probe consisting essentially of 
the isolated DNA molecule of claim 33. 

36. An immunological composition comprising the amino 
acid molecule of claim 32. 

37. An isolated amino acid molecule of claim 24, 
further comprising a 17 amino acid, partially hydrophobic tail. 

38. An isolated amino acid molecule of claim 32, 
further comprising a 17 amino acid, partially hydrophobic tail. 

39. An isolated amino acid molecule of claim 24, 
further comprising an epitope of interest. 

40. An isolated amino acid molecule of claim 32, 
further comprising an epitope of interest. 

41. An immunological composition comprising the amino 
acid molecule of claim 39. 

42. An immunological composition comprising the amino 
acid molecule of claim 40. 
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AMINO ACID SEQDENCES IN THE NH2-TBRM3aiAL END OF DIFFERENT PSPA GENES. Gar 
inserted to maxiMze alignment with related PspA secjuences) . * 

ArCC6303 MNKKKBCEIiTS IASVAHiGTG rVASPPTXiVR AEESPQWEK SSMKKYEEA 
KAKADTAKKD YETAKK. . .K ASaQKKXDE DQKEC!!PH?KAK A.VKKVDEER 
QKAILAVQKA YVEY RE AKDKASAEKQ lAEAKRKT 



Ac94. . . MNKKKMILTS liASVAILGAG LVTAQPTLVR AEEAP.VASQ SKAEKDXTOA 

KRDAENAKKA LEEAKR AQXKirED D QKKTKKK AK E.EKQASEAE 

QKANLQYQLK LREYTQ. .KT GDR5KIQTEII EEAEKKHKTA KAETOKVRGT 
VIPSAARV 



Bgll703pro MJKKKHILTS LASWAILGAG LVTSQPTLVR AEEAP. VASQ SKAEKDXDAA 
VKKSEAAKKA YEEAKK. . .K AEDAQKKYDE DQKRTEEKA, ENBKKAAADL 
TEATEVHQKA YVRYSGSNEQ KIKNFKILAI 



Bg7322pro MXKKKMH/PS lASVAILGAG iXVASQPTOVR AEDAP.VANQ SQAEKDYXAA 
XXKSEAAKBDC YXXAKKVIAE AEAAQKX3CED XQKKPEEKA. EKAKAASEEI 
VKATEEVQXA A 



Bg7561pro MNKKKMIIiTS LASVAILGAG LVTSQPTLVR AEEAP.GASQ SKAEKDYXAA 
XKKSEAAKKA YEEAKK. . .K AEDAQKKTOE GQKKTEEKA. RKAEEASKEL 
AKATSEVQNA YVKYQGVQRN SRUJEKERKK QLAEIDEEIN KAKQIWNEKN 
EDFKKVREEV IPEPTELAKD QRKAEEAKAE EKVAKRKYDY ATLKVALRKS 
Y\ ^ft^ AXL 

Bg8090pro MNKKKMILTS LASVAILGAG LVTSQPTFVR AEEAP.VASQ PKAEKDXDPA 
GKKSEAATKA YEDAKP. . .T ASnAQKKYDE AQKKPDAER 



Bg8743pro MNKKKMILTS LASVAILGAG LVASQPTWR AEEAP.VAKQ SQAERira»A 
MKKSEAAKKE YEEAKKDLEE AKAAQKK^GG DPKKTGEETK LVPK.ADC2R 
PKANVAVPKA YLKLREAQEQ LNQSPNNKKN SAQOKLKDAL AHIDEVTLNQ 



Bg8838pro MNKKKMILTS LASVAILGAG LVTSQPTWR ABESP.VASQ SKAEKDYDAA 
VKNATAAKKA AEI2AHRALDE AKAAQKNYDE DQKKPEEKAK EVPKAPAEE. 
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Bg9163pro MNKKKMILTS lASVAILGAG LVASQPTLVR AEDAP.VANO SQAEKDinDAA 
MKKSEAftKKE YEDAKKVLAE AEAAQmED DQKKTEEKA. ENaUftASEEI 
AKAJTEEVH : 

Bg9739pro MNKKKMILTS LftSTAlLGAO LVASSPTWR AEEAP.VJiSQ SiaEKDroiA 

KRDA^MKA LBEMCR AOHaOD YQRRlEEKJkX K.ETQASLEO 

QBAMDSQEiK LmiiDGRilL SNSSVLKKBM BEAEKKDKEir OABF^^^ 

S^SSS^ ATBSOLVTRV BEAERNVTOJi ^\So^ 
VVDQAXZAEL S9GGHKLEPK 
Dbllpro MNKKraCXTS tASSCJULOAa LVASQPTWR AEHAP.VASQ SKAEKDmWl 

KRDAENAKKA LEEAKR AQKXXED DQKRIEBKAK X.nXQASEaE 

QKANUnrQLL LQKYVSESDG KKKKEXEXXA DAAIOaBIELK XADIJaSxQB 



^ISpro MNKKKMILTS LASVAILG&G LVASQPTWR AEEJ^P.VASQ SKAKKDTOAA 

^'E^^SKAAEED LE E AEAAQRIOTE DQKKSEENEK E.TEEASERO 

QAAmKYHLE SXEPLNYFQD NHR 



Dbl6aapro 



nbl6aDro 



MNKKKMILTS 

KRDAENAKKA 

QEANKDYQLK 

IWFNPQELE 

EVVLQAQIA. 

MNKKKMILTS 

KRDAEKAKKA 

QEANKDYQIiK 



lASVAILGAG 
LEEAKR. , . . 
LKKSOD^OIL 
MARRKSEVAK 

LASVAILGAG 
liEEAKR. ... 
LKKYLDORNL 



LVASPPTWR ABBAP.VASQ 
• . ^AQ ETOfAD YQRRIEEKAA 
SNSSVLKKEM BBAEKRDKEK 
TKESGLVKRV BBAEKKVTEA 



SKAEKDVDTA 
K.ETHASLEQ 
PAEFNKIRRE 
RPKLDAERAK 



LVASPPTWR ABBAP.VASQ SKAEKDYDTA 
...AQroraD YQRRIEEKAA K.ETHASLBQ 
SNSSVLKKEM EBAEKKDKEK QAGL 



Efl0197pro 



£f3296pro 



Ef6796pro 



1.819 OSpro 



Rxlpro 



MNKKKMII^S 
KRX>AE»ZAKKA 
QEANKDYQLK 
XWFNPQELE 
E WLPP TR*V 
MNKKKMILTS 
KAKADTAKKD 
ZDVALWQEIA 
EQQDLQNNFN 



LASVAILGAG 
LESAKR* . . . 
LKEOfLDGRNL 
MARRRSEWK 
ENEVHKLXQK 
LASVAILGAG 
YETAKK. • .K 
YXEr.« • .RE 
EVRAWAPDP 



LVTSQPTLVR AEESP.VASQ SKAEKDYDAA 
...AQEKJM YQRRIEEKAA JC.BQQAfitBQ 
SNSSVLKKEM EE A E KRDKKK QAEFNKIRRE 
AKESGLVKRV EEAEKKVTEA RQKLDAERAK 

LVTSQPTFVR AEESPQWEK SSLEKK5ffiEA 
AEDAQKranED DQKRTEEKAR K.EAEASQKL 
VQNQRSKYKS DADyQKKI/rE VDSKIEKARK 
TCVGXDXR 



^tt^KO^ LASVAHCAG XVTSQPTSVR AEEAPQWEK SSLEKKYEEA 
raKYDA^ YDBAKK. . .K AAEAQKKYEE dSeEKAE K.JSlA^ 
^^EEVQKA VLDYITAIRN HNDSGKTSAE EAENKAKERD YCCAGKKFDP 
IQTPFVASLT QMTT. ^ 

^^^f '^^E^ LVASSPIWR ABBAP.VASQ SKAEKDYDTA 
KRDAEilAKKA LBBAKR AQEaOAD YQHRIEEKAA K ETOAfflJEO 

S^f^ ^^^^^^ 



^^miTS LASVAILGAG FVASQPTWR AEESP.VASQ SKAEKDYDAA 
KKDAKNAKKA VBDAQKALDD AKAAQKKYDE DQKKIEEKA ^SaAS^ 
DKAVAAVQQA YLAYQQATDK JS^KMI 
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E£5568pro 



Bg6692pro 



Acl22pro 



A66pro 



L82013pro 



WPEFBQIAE TKKKSEEftKQ KAPEIiTKKLE EAKAKIjEEAE KKATEAKQKV 


MmooMiLTS LASVAiLGftG lva:^ptlvr AEESP.VMQ SKAEKDYDAA 
VKKSEZIAKJO^ YEEAKKALEE AICVAQKKYED DQXKTKUKA. ELEKEASEAI 
AKATEEVQQA YLMTQBASNK A. .EAAKMIE EAQRRENEAR AKPTTIRTM 
WPEPEQIAE TKKKASEAKA JCSFIOAKKAA EAKAKIiBBJ^ KKAIEAMFQV 
DA 

MtncKKMTr/rs lasvailgag fvassptfvr aeeap.vanq skaekdxdaa 

VKRSEAAEOCD YBIAKK, • .R A£DAQKXYDE 23QKKTEAKAB R.ERK2^SEKI 
AEAITKEVQOA YIiAYZiC2ASNB SORKEADdOCI lOSATKiaCMRR TCMLXIEFSQ 
QLYFLHQVS^ LRUIKKQKRQ QKKQIGrLREN LKHQLKRYKT BKIKmiKML 



IdNKKKLIVTS LASVAILGAD SVTSFPALVR ADEASLIASQ SKAEKDYDftA 
XKDAKNAKKA VEEAQKALDD AmQKKYDE DQKKTEKK&A AV.KKIDEEH 
QAAKIiKSQQA LVHFLAAQRE GITFKKKKAAQ ATT,KEAENAE KBIK 



MIKKKMIKTS liASAAlFOAX SETSQPTRVR PVEXPB.ARH PKVDKYYDAB 
ADE!f 



MNKKwia lASVAILG^ FGCVSAYSCK SRRISRS*SA *SSQRL 



MNKmrTTiKS LASA7VISGAX LVXPQPTLVR AE£SP*AASQ SHPEQDYI3XX 
XXLCXXLXHQ PSXGRfrLIiXX XXSXFXSPTP XXX3QCXP9CSX LTXDGEUDDC 
LKPFPLPXSX PXPPXPPXSP PSPPEREXLY XXPPXPXmi SLXLIPFLLL 
XLPPPXXXLP HLXSPPXPXL* PPSPTPX 
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SEQUENCES IN THE CEOTRAIi REGION - (Includes Caxboxy-tcnainxxs of eapha-heli 
ana region and scnne of the proline-rich region. Gaps are inserted to 
maxiiaize allgzment related FspA sequences.} 
30 33C 



0922134c 



Atcc6303c 



Acl22c 



I* KE IP ESDSED YLKBGLRAPL QSKLDTKKAK LSKLEELSDK 

IDEIOAEIAK L EVQLK DREG NNMVE A.YFKEC5LEK TTAEKKAELE 

Sf^S??^^ SfS?^* PAPEKPAK K 

PAPAPAP... EKPAPAPE K.PAEK PAEKPAEEPA EKPAPAPEK. 

PAPTPE .KPAPTPETP KTCWKQEMGM 

• • 'i^A; 4a;a;a;4;;, ; ^ ldxtiaegka qiaaxppnid 

RT FKDL EDSGLGLEKV LATLDPGGET PDQLDKEASE DSNIGALPNQ 

VSDLENQVSE LDREVTOLPS DLKDTEXSNNV 6EYVKGGLEK ALTDEKVBIU 

KTPKALDTAP KALDTALNEL G.PDQDEEET PAPAPiCPB QPA 

K PAPAPK PEKTEOQQAE EDXARfiSEEE 

TORLPQQQPP KREK- .PAPA PKPEQPVPAP, . • . _ 

GGW SWR*ILrARP 

DRIAARQAEIi AQKQTBIiaKIj LDSLDPEOKT QDEXDKEXGE AELDKK 

ADGLPNKVSD LEKEISNLEI LLGG&DSEDD T AALPN KLATKKAELE 

OTQKELDAAL NELG PDCSDBEET PAPAPQPE Q 

PAPAPiOPEQ. PTPAPK PEQPTPAPKP EQ. .PApI ! . 

AP KPEQ..PAPA PKPEQPAPAP KP.EQPTPGP KIB 



A66c 



Ac94c 



Bg8090c 



Bg8743c 



Bg9163c 



LLLLEKAGLG 

K&GADLKEAV NEPGESAGEP SQPEEPAEEA PAPEQPTEPT 

QPEEP AGETPAPKPE K. • .PAGQPK AEKTODQQAE mYARRSEEE 

VNRLTQQQPP KAEKPAPA. S PQPEQPAPAP K 



^ K EIDES DSED YVKEGLRVPL QSELDVKQAK LLKLEELSDK 

^^^AHCAK NLKKDVKDPQ NSGGCTS. . . .ALVLEAAEK DLVAKKAELE 

KTEADUOCAV NBPEKPAEE PENPAP VVVVaS 

PAPAPQP EKPA. . . PAPAPK PEKSADCX2AE EDYARRSEEE 

YNRLTQQQPP KAEKPAPAPV PKPEQPAPAP KSR .77777 

VXrj>RGPAEA AVKEQVDSPP QQIAD*VKEi STHGKFLGGA ATEDETSALP 

KKITAKQAEt. AKKQTOLEiai I-DNLDPECSKT QDKLDKEAAE AEUaiCK 

ADELPN KVAD LEKBISNLEI LLGQADPEDD T AALPN KLATKKAEPE 

KTPKEIIiAMi NELO,.,*. . . .PDGDEEEO* PA 

PAPAPKPEQ PAPA PAPKP EQPAPAp! .7 

AP KPEQPAPAPA PICPEQPTPAP K 7 



^ KEIDE SDSED YUGBGLRAPIi QSKTiDAKKAK LSKLDELSDK 

.^S^^^'^S?^ LEKDVGDPPN SDGBQ AGQYLV&AEK DLDAKEAELG 

NTGADLKKAV DEPETPAPA PAPK PAPAPAPT p 

EAPAPA. . • . PKPAPAPK PAPAPK PAPAPKPAPA PKPAPAPk! . 

. . • • PAPAPKPE RT 



.END 
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GVQRTRKRAP KRIMSLSQKV XUOCVCRAPL QSKZiDtAQKAB IjLKLEELSGK 

lEELUAEIAE LEVQLKDAEG NMNVE A.YFKEGLEK TTAEKKAELE 

XAXADtiKKAV DEFETPAPA. .w. .PAPA PAP a 

PAPAPA PAPAPAPK PAPAPK PAPAPAPAPA PKPAPAPk!. 

PAPAPAPA PKPEKPAEKP APAPKPEIXK TtQ 



Bg9739c 



L KEIDE SDSED YVREGFRAPL QSELDAXQAK LSKLEELSDK 

iraZiDAEIAK L EKDVE DFQH SDQBQ AGfifniAAAGE DLIAKKAELE 

KAERDLKKAV DEPETPAPA PA*. PAPAPAPT P 

EAPAPAPAPA PKPAPAPK PAPAPK PAPAPKPAPA PKPAPAPK., 

PAPAPAPA PKPEKPAEKP APAPKPB 



EflOlSc 



Ef3296c 



Ef6796xc 



L KE I DES D SED WKEGFRAPL QSELQAKQftK IiSKLEELSDR 

II3ELDAHXAK LEDQLKAAES MNKTVH* DVFKBGLEK TXAilKKAELE 

KTEADL KKAV NEETOPAEEP SQPEKPAEEA PAPEQPTEPT QPEKPAEQPQ 
papapqpekp.aeetpXpkpe "^R. . .PAEQPR AEKPADQQAE HDVAKRSECE 
7NRLTQQQPP KAEKPAPA.. PKTK : 



GGS 

17ELAAKQAEL 
ADELPNKVAD 
XTQKELDAAP 
PAPAPKPEQ. 

AP 



ALDQEAAAPP HQVADLEKQI 
AQKPTGLEKL LDSLDPGGKT 
LEKEISNLEI LLGGADSEDD 

NELG PDGDEEBT 

PAPAPK 

KPEQ..PAPA PKPEQPAKPE 



TGPEIFLG6A DPEADIAARP 

QDELDKEAGE AELDKK 

T. *«,AALFN KIiAXKXAEIiE 

PAPAPQPE Q 

PEQPAPAPKP EQ. .PAP. . . 
KPAEEFTQPE KPATPKT. . . 



CblSc 



L81905C 



RccllSc 



VRAL. .KVAE F6VQLRDAG6 SNNVG A.YFKEGLEE TTAEXKAGLG 

KAEADLKKAV DEFET PAP A 

PAPAPA PAPAPAPK PAPAPK PAPAPAPAPA PKPAPAPK*. 

. . - PAPAPAPA PKPEKPAEKP APAPKPETPK T. _ 



... * L KDIDESDSED YAKEGLRAPL QSBLDTKKAK I*LKLEELSGK 

lEELDAEIXE IiEVQIiKDAEG MNNVE A.YFKEGLEK TTAEKKAELE 

KAEADLKKAV DEPETPAPA PAPA PAPAPTPE A 

PAPAPA.. PKPAPAPK PAPAPK PAPAPKPAPA PKPAPAPKPA 

PAPAPAPAPK PAPAPAPAPA PKPEKPAEKP APAPKPETPK TGWKQENGM. 



« . L KEIDE SDgED YVKEGFElAPli QSELDAXQAK LSKLEEXSDK 

XDELDAEIAK LEKDVEDPKN SDGEQ AGQYLAAAEE DLIAKKAXLE 

KAEADLKKAV DEPETPAPA PA.. PAPAPAPT P 

EAPAPA PAPAPK PAPAPK PAPAPKPAPA PKPAPAPK.. 

PAPAPAPA PKPEKPAA 



LKEXDESDVE VKKAEEELVK EEAKEPRNEE KVKQAKAEVE 

SKKAEATRLE KIKTDRKKAE EAKRKAAEED KVKEK 

..PAPKPEN PAEQPK AEKPADQQAE EDYAHRSEEE 

YXRLTQQQPP ICTEKPAQPST PKT «vi«i-iw:. 
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Ret 121c 



Rctl23c 



Rctl29c 



Rctl35c 



K GKflRESRXEE KVNQPKXEVE 

SlWU^aTRIiB KIETCRICKAE EAXBKAAEED KVKEKFASQF QFAFAFQFEK 

PAPAPKPEN PAEQPK AEKPADQQAE EDYJfflRSEBE 

TORMXXXJPP KTEKPAQPST XK 1 



1 KEXEESXSED lOKBGLRAPL QSKLDTKKMC LSKLEELSDK 

JDELDAEIMC LEVQLKDABG HNNVB A.YPKECSLEK TTAEKKAELE 

KAEADLKKAV DEPETPAPA PQPA PAPEKPAE K 

PAPAPAP EKPAPAPEK. 

PAPTPE. .KPAPTPETP KTGWEQQENGM WYFYNTTX3SM 

ATGWLQNNGS WYYLNSBGAM ATGWHQ^S HYYINS 

L KEIDESDSED YLKEGLRAPL QSmXIKKAK liKLEELSDK 

I DKliDAEIA K liBVQLRlJAEG NNNVE A.YFKBGliEK TOAEKKT^ELB 

KAEADLKKAV DEPDTPAPA PQPA PAPEKPAE *:K 

PAPAPAP... EKPAPAPB K>PAPA PEKPAP. .AP EKPAPAPEK. 

PAPAPE .KPAPAPEKP APAPKPETPE TOLETRKRY. 



L KEIDESDSED YLKEGLRAPL QSKLD7TKKAK LSKLEELSDK 

IDELDAEIAK LEVQLKDAEG NNNVE A.YFKEGLEK TTAEKKaELE 

KAEADLKKAV DEPETPAPA PQPA PAPEKPAE K 

..PAPAP.«, EKPAPAPB K.PAPA P EKPAPAPEK. 

PAPAPE .KPAPTPETP KTGWKQENGM 



16/43 



wo 97/09994 



PCT/US96/14819 



BXlC 



Bg6652c 



Bg8838c 



Dbl6ac 



Dbllc 



L820131C 



L KEIDESESED YAKEGFRAPL QSICLDAKKAK LSKLEELfiDK! 

JDELDAEXAK I.BDQIiKAA£E mrFKBGLSK TXA^UCICAEIf 

KTEADLKiaW NEPEiCPA PAPET PAPEAPAE QPK 

PAPAPQP AFAPXCFS K. * .F^^PK PEKTDDQQAE ECYARR5ESB 

'XNRI/TQQQPP KAEKPAPA. . PKTGWKQENG MWYFYMTDGS M • . . . 



* - w GBQA. •QQYRAAABG DLAAKQAELE 

KTERDLKKAV MEPEK. .PA> PAPET PAPEAPAE QPK 

PAPAPQP APAPKPE K. . .PABQPK AEKTDDQQAE EDYARRSEEE 

TMRLTQQQPP KAEKPAPA. ♦ PiCPEQPAPA 



« * . « . «PK NSKC^A. . . .EQYRSAAGG DLAAKQVEI£ 

KTEADLKKAV N£P£K. .PA PAPET PAPEAPAE QPK 

PAPAPQP APAPKPE K. . .PAEQPK AEKPADQQAE EDYDRRSEEE 

YNRLTQQQPP KAEKPAPA. . PQPBQPAPAP KS 



L K EIDES DSED YVKBGFRAPL QSELDAKQAK IiSKLEELSDK 

IDELIltAEIAK .IiEKETVEOFK XSDGBQA* « * .GQYLAAAEE DIiIAKKAELB 

QTEADLKKAV NEPGKPAPA PAPET PAPEAPAE QPK 

PAPET. P.. . - ..APAPKPE K*, .PAEQPK PEKPADQQAE EDYARRSBEE 
YNRLTQQQPA PAQKPEQP.. AKPBKPAEEP TQPEK 



. . . .DAEIAK .LEKNVEYF^ KIDAEQT. . • .EQYLAAAEK DLADKKAELE 

KTEADLKKAV NEPEKPAEE TPAPA PKPEQPAE QPK 

PAPAPQP... ...APAPKP. EKTDDQQAE EDYARRSEEE 

YNRLPQQQPP KAEKPAPA.. PKPEQPVP 



• • • • A EXPENPAP APK 

PAXAPQPLKP EEPAEQPKPE KPEEPAGQPE PEKPDDQQAG EDYARKSGGE 
VNRFPQQQPP KAEKPAPA.. PKPBQPVPAP KT 

Bgll703c I.LKKA KLAGAKSKAA 

TKKAELEPEL EKAEA ELENL LSTLDPEGKT QDELDKEAAE . . . .AELNKK 
VEALPNQVSE LEBELSKLBD NIiKnAEINNV mv ri tKrrr.Typ AIATKQABLE 

KT P KELDAALNBL 5.PDGDEEBT PPPEAPAE .QPK 

PEK.PABBT. PAPAPK PBKSADQQAE EDVARRSBEB 

yNRLTQQQPP KAEKPAPAPA PKPEQPAPAP KSR 

^^^^to \\\ ilGIiA'raKKL NLAEAIllELL 

LKKLGLEPGli EKAOAGUaflli LSTLDPEOKT QDBliDKBAAE . . • .AEIMKK 
VEALPNQVAE LEEELSKLED NLKDAETNHV EOTIKEGLEE AIATKQABLE 

KT. . . . . . .P KELimUiEL G.PDQDEEET PAPEAPAE QPK 

PEK.PAEET PAPAPK PEKSADQQAE EDYARRSEEE 
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Bg7561c 



£f5666c 



Wu2c 



IfNRLTQQOPP ZCAEKPAPAPA PKPZQPAPAP K. 



• • • KKQKV miENL LSILDPGGKT QDELE2KGAAE AELMKK 

VEALPNPVXB IJSEELSPEED NLKDAEINH^ EDYIKEGLEE AIATKQA^ 

ET P QEVDAALNDL V.PDGGEEBT PAPAPQED. • EPA 

PAPAPNABQ PAPAEK PEKSADQQAE EDYARRSEGE 

iNRI/rQQQPP KAEKPAPAPA PKPEQPAPAP N. .TTTTT ^^^^^ 



2CEIAR ZiQSnLKDAES KNVEZ3Y1KBG 

ICT. • • .QKDL EDftEL ELEKy LAITLDPBSKT 
VEAUJNQraE IiEEELSKLED KLKDAETOW 

KT Q KELDAALNEL 6.PDG0EEET 

EEP. • ..EN PAPAPK 

TORLTQQQPP KAEK^.PAPA PQPEQPAPAP 



I'EOAITOKKA ELATTQQNID 
QD ELDK EARE •...AELNEK 
EDYIKEGLEE AIATKKAELE 

PAFAPQPE. KPA 

PEKSADQQAE EDYARRSEBB 
KIE 



^ KSIDESESBD VAKEGFRAPL HSKLDAKKAK I-SKLKELSDK 

IDELDABIAK LEDQLKAVEE KNNVE ETYSTEGLEK TIAAKKTELE 

KTEADLKKAV NEPEKSABEP SQPEKPAEEA PAPBQPTEPT - 

Q^^EKP AEETPAPKPE . .PAEQPN AEKTDDQQAE EDYARRSEEE 

TORLTQQQPP KAEKPAPA. . PQPEQTSSLH 
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Coas»I«to «*aueiice for ZW5SSB 

Sdquenc* Sangs: 1 to 14S3 ' 




10 20 30 40 SO 60 70 

arrGAOkAASA TTTAOGGAOG ACGCIT\TGC TIAASAXAAG TATAOGCTAA AAATGmAX CAGAMUGAfl 

SO 90 100 110 120 130 

OTMOITTZAG jmSMTJOGAAXAAAJaCmnXACAAOCCIAGCClySCGTCGCT 

HNKKKMXLT5L&SVAlIiG> 

140 ISO 140 170 leO 150 

» « • * • » 

OCX OCST TTT <JTT CCO TCT ICO OCT ACT TTT OTA AGX OCX QXX GXX OCT OCT GTA OCT AXC 

AGPVASSVTTVRASHAPVAN> 

200 210 220 220 240 2f0 

• * * • * « 

CASTCTAMGCTGAOAAAGACTKrOKrOCAGOlOTGAAX MA TCT G&A OCT OCT AAG AIA 
QSKAEKDYBAAVRXSSAAXIO 

200 270 200 290 300 310 

• . * * • ♦ * 

OAT TAC GXA ACG OCT AXA AXO AAA CCA CAA QAC OCT CAS AAO AAA. TAT CAT OAC CAT CAC 
D Y B T A X K K A E D A Q X K T D d Q> ^ 

320 330 340 3S0 3(0 370 

• * • • « « « 

AAO AAA ACT GAG OCA AAA GCG GAA AAA O&A AGX AAA OCT TCT QAA AAG ATA OCT GAG OCA 
XKT£AKA£X£RXASSXIACA> 

280 390 400 410 • 420 430 

• • * ♦ « • 

ACA AAA OAA GTT CAA CAA GCG TAC CTA OCT TAT CTA CAA OCT AGO AAC GAA ACT CAO ABA 
TKEVQQAYLAYJ.QASIIEffQR> 

440 150 440 470 480 490 

• • ♦ • * • 

AAA GAG OCA GAT AAG AAG ATA AAA GAA OCT ACG CAC GCA AAG ATG AGO OGG AGO TOC AAT 
XEADXXIXfiATHAXHRR. TCK> 

SCO 510 520 530 ^ 540 550 

• * • • ♦ » 

TTG ACT Arc GAA TTC GAA CAA CAA TT5 TAC TTC CTG AAC CAA GT6 ACT TAC CTC AOA CEA 
XiTxa7'aoQX«YrXiHfiV6VL3i£> 

$60 570 580 590 600 610 

• « • • « ^ 

AGA AAA AAO CAG AAG AGO CAA CAA AAG AAC CAG AAG TAT CTA AGA AAA AAT CTG AAG AGO 
RKKQXRQQXXQXYLRXMLXA> 

«20 630 640 650 660 670 

• ♦ ft ft 

CAG CTA AAC AGO OAT AAC TAT AGA AAA ATA AAA TACTTGAACAAGATOCTGAAAAOGAAA 
OX,KRVKy]iXIXYX. »XKX.XTK>^ 

480 690 700 710 720 730 

• • • » • • 

AGA AAA TTG ACO TAC TTC AAA ACA AAC TCC CTG ATT TAT AAA AAG GAA TTG CTC TOC ATC 
RKL»yyXT KSriYXXXl.LSl> 

740 750 760 770 780 790 

• » ♦ • ♦ * 

AAA AO^arcGCTaAATTAAATAAAGAAATTOCTAGACTTCAAAOCGATTTAAAACATOCT 
K ^ 

VAZLNXHrAXI,QSDt.KDA> 

•00 BIO 830 830 840 850 

• * » • • ft 

GAA GAA AAT AAT OTA GAA GAC TAC ATT AAA GAA GOT TCA GAG CAA GCT ATC ACT AAT AAA 
BEMKVEDYIXEC1.XQAIT»X> 
860 870 B80 890 900 910 
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AAA GCT GAA TTAGCTACAXC^CXACAAAACXTACUlTAAAACTCAAAAXGaTrrAC^ 

920 930 940 950 »£0 970 

• • « * • i 

GCT GAA TTA GAA CTT GAA AAA GTA TZA GCT ACA TZA GAC CCT GAA OC?r AAA ACT C3UI GAT 
ABI.ELBKVLATI.DP EGXTQD> 

980 990 1000 1010 1020 X030 



GAA TTA GAT AAA GAA GCT OCT GAA OCT GAS TTC AAT GAA AAA GTT GAA GCT CIT CAA AAC 
ELDKEAABAEINEKVBAL Q^ 

lOiO 1050 1060 1070 1080 1090 

* • • • * • 

CAA OTT OCT OAA TTA GAA GAA GAA CTT TCA AAA CTT GAA GAIT AMP CTT AAA GAT OCT OAA 

QVABLEEBLS1CI.E1)HLX0A^ 

- UOO 1110 lUO 1130 1140 1150 

* » • * , ♦ 

ACA AA£ AAC GTT GAA GAC TAC ATT AAA GAA CCT TTA GAA GAA GCT ATC OCG ACT AAA AAA 
TSNVEDYIXEGLEEAIAT K^ 

lUO 1170 1180 1190 1200 ' 1210 

* ^ • » ^ ^ 

CCT GAA TOO GAA AAA ACT CAA AAA CAA TTA CATCCACCTCrPAJKrCaCTTACCCOCTGAT 
AEI.EXTQKBLDAALHEI,C P ^ 

1220 1230 1340 1350 1250 1270 

* • • * • 

GGAGATGAAGAAGAGACTCCACCGCCCGCTCCTCAACC31C3AAAACCAGCTGAAGAGOCT 
C O E E E TPAPAPQPEKP A T B 

1250 1290 1300 1310 1320 1330 

* * » ♦ • 

GAC AAT CCA GCT CCA CCA OCA AAA CCA CAG AAC TCA OCA GAT CAA CAA CCT CAA GAA GAC 
EWPAPAPKPEKSAPQQAESl^ 

1340 1350 1350 1370 1350 1390 

TATGCTCOTAGATCAGAACAAGAATATARXCGCrrCAXCAAC^ 

yarr.seeeykrl'Itqqqppk^ 

1400 1410 1420 1430 1440 1450 

GAAAAACCAGCTCCTCCACCAOACCAGACCAAOaiGCTCC^ 
bAPAPAPQpEQPA?APKZEA> 
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1^ 



If 



MC27 MC28 



Top of Gel 
190 kDa 
82 kDa-=r 
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LSMpgpA13/2 



LSMp5pA12/6 




4.2 kb 
3.6 kb 
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Primer LSM13 : gcaagcttatgatatagaaatttgtaac 
Primer LSM2 : gcgcgtcgacggcttaaacccatt caccattgg 

Probe LSMpspA13/2 (from RXl sequence) : 
aagcttatga tatagaaatt tgtaacaaaa atgtaatata aaacacttga 
caaatattta cggaggaggc ttatacttaa tataagtata gtctgaaaat 
gactatcaga aaagaggtaa atttagatga ataagaaaaa aatgatttta 
acaagtctag ccagcgtcgc tatcttaggg gctggttttg ttgcgtctca 
gcctactgtt gtaagagcag aagaatctcc cgtagccagt cagtctaaag 
ctgagaaaga ctatgatgca gcgaagaaag atgctaagaa tgcgaaaaaa 
gcagtagaag atgctcaaaa ggctttagat gatgcaaaag ctgctcagaa 
aaaatatgac gaggatcaga agaaaactga ggagaaagcc gcgctagaaa 
aagcagcgtc tgaagagatg gataaggcag tggcagcagt tcaacaagcg 
tatctagcct atcaacaagc tacagacaaa gccgcaaaag acgcagcaga 
taagatgata gatgaagcta agaaacgcga agaagaggca aaaactaaat 
ttaatactgt tcgagcaatg gtagttcctg agccagagca gttggctgag 
actaagaaaa aatcagaaga agctaaacaa aaagcaccag aacrtactaa 
aaaactagaa gaagctaaag caaaattaga ag'aggctgr^g aaaaaagcta 
ctgaagccaa acaaaaagtg gatgctgaag aagtcgctcc tcaagctaaa 
atcgctgaat tggaaaatca agttcataga ctagaacaag agctcaaaga 
gattgatgag tctgaatcag aagattatgc taaagaaggt ttccgtgctc 
ctcttcaatc taaattggat gccaaaaaag ctaaactatc aaaacttgaa 
gagttaagtg ataagattga tgagttagac gctgaaattg caaaacttga 
agatcaactt aaagctgctg aagaaaacaa taatgtagaa gactacttta 
aagaaggttt agagaaaact attgctgcta aaaaagctga attagaaaaa 
actgaagctg accttaagaa agcagttaat gagccagaaa aaccagctcc 
agctccagaa actccagccc cagaagcacc agctgaacaa ccaaaaccag 
cgccggctcc tcaaccagct cccgcaccaa aaccagagaa gccagctgaa 
caaccaaaac cagaaaaaac agatgatcaa caagctgaag aagactatgc 
tcgtagatca gaagaagaat ataatcgctt gactcaacag caaccgccaa 
aagctgaaaa accagctcct gcaccaaaaa caggctggaa acaagaaaac 
ggtatgtggt acttctacaa tactgatggt tcaatggcga caggatggct 
ccaaaacaac ggttcatggt actacctcaa cagcaatggt gctatggcta 
caggttggct ccaatacaat ggttcatggt attacctcaa cgctaacggc 
gctatggcaa caggttgggc taaagtcaac ggttcatggt actacctcaa 
cgctaatggt gctatggcta caggttggct ccaatacaac ggttcatggt 
attacctcaa cgctaacggc gctatggcaa caggttgggc taaagtcaac 
ggttcatggt actacctcaa cgctaatggt gctatggcta caggttggct 
ccaatacaac ggttcatggt actacctcaa cgctaacggt gctatggcta 
caggttgggc taaagtcaac ggttcatggt actacctcaa cgctaatggt 
gctatggcaa caggttgggt gaaagatgga gatacctggt actatcttga 
agcatcaggt gctatgaaag caagccaatg gttcaaagta tcagataaat 
ggtactatgt caatggttta ggtgcccttg cagtcaacac aactgtagat 
ggctataaag tcaatgccaa tggtgaatgg gtttaagccg 

Tiaure 
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Primer LSM12; 
Primer LSM6 : 



ccggatccagcgtcgctatcttaggggctggtt 
ctgagtcgactggagtttctggagctggagc 



Probe LSMpspA12/6 (from RXl sequence) : 
ccagcgtcgc tatcttaggg gctggttttg ttgcgtctca gcctactgtt 
gtaagagcag aagaatctcc cgtagccagt cagtctaaag ctgagaaaga 
ctatgatgca gcgaagaaag atgctaagaa tgcgaaaaaa gcagtagaag 
atgctcaaaa ggctttagat gatgcaaaag ctgctcagaa aaaatatgac 
gaggatcaga agaaaactga ggagaaagcc gcgctagaaa aagcagcgtc 
tgaagagatg gataaggcag tggcagcagt tcaacaagcg tatctagcct 
atcaacaagc tacagacaaa gccgcaaaag acgcagcaga taagatgata 
gatgaagcta agaaacgcga agaagaggca aaaactaaat ttaatactgt 
tcgagcaatg gtagttcctg agccagagca gttggctgag actaagaaaa 
aatcagaaga agctaaacaa aaagcaccag aacttactaa aaaactagaa 
gaagctaaag caaaattaga agaggctgag aaaaaagcta ctgaagccaa 
acaaaaagtg gatgctgaag aagtcgctcc tcaagctaaa atcgctgaat 
tggaaaatca agttcataga ctagaacaag agctcaaaga gattgatgag 
tctgaatcag aagattatgc taaagaaggt ttccgtgctc ctcttcaatc 
taaattggat gccaaaaaag ctaaactatc aaaacttgaa gagttaagtg 
ataagattga tgagttagac gctgaaattg caaaacttga agatcaactt 
aaagctgctg aagaaaacaa taatgtagaa gactacttta aagaaggttt 
agagaaaact attgctgcta aaaaagctga attagaaaaa actgaagctg 
accttaagaa agcagttaat gagccagaaa aaccagctcc agctccagaa 
actccag 
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SKH2 5CATACCgTTTrCrrgnTCCAgCC-3' 

LSM13 5' gCA AgC TTA TgA TAT AgA AAT Tig TAA C -3' 

N192 5' ggAAggCCATATgCTCAAAgAgATTgATgAgTCT -3" 

C588 5' CCAAggATCCTTAAACCCATTCACCATrggC -3" 



28/43 



wo 97/09994 



PCTAJS96/14819 



leader 



a-helical 



N 



prolines repeats 

E 



449 




4 s 6 



tail 

EEH- 



590 652 



892 



leader 



a-helieal 



prolines repeats 



tail 



N 




147 



A - B 



288 370 



577 588 



LSM13 



N192 



CS56 



LSM2 



Figure^' 



r Companson of the structural motifs of PspA and PspC. 
PspA has a smaUer a-hellcal region, and does not contain the direct 
repeats within the a-helix (indicated by the dashed lines). The a- 
helical regions which are homologous between PspA and PspC are 
indicted by the striped pattern. PCR primers are indicated by the 



arrows. 
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249 SDSSVGEETLPSPSLNMANESQTEHRKDVDEYIKKMLSEIQUDRRKHTQN 298 

.:|.|:.:. | : :|.|:: ,j |.: , ..:|..:: 

. 1 EESPVASQSKAEKDYDAAKKDAKNAKKAVED.AQKALDDAKAAQK3CYDED 49 

299 VNI^IKLSAIKTKYLYELSVLKENSKXEELTSKTKAEI^TAAFEQFKK^ 348 
_ • . . -h I.. .||: .|. |.: .|: .:...| 

50 QKKTEHKAAL EKAASEEM.DKAVAAVQQAYIAYQQATD 86 

349 KPEiaCVAEAEKK\^EAKKKAJa5QKEEDRRNYPTN^ 398 
..|:|.::||||;. || :.: : 

87 KAAK. .DAADKMIDEAKKREEEAKTK FNTVRAMW 119 

• " • • . 

399 KEAELELVKEEANESRNEEKIKQAKEKVESKKAEATRLEKIKTDIUCKAEE 448 

-:.|-: :.|.| I- |..|. | .| | 

120 PEPEQLAETKKKSEEAKQKAPEL TKKLE 147 

449 EAKHKAEESEKKAAEAKQKVBAEEYALEAKIAEIXYcVQRLEKELKEID 498 

III I IMIIMIIIIIIill I :||lllll :|:||MIIII|| 
148 EAKAKLEEAEKKATEAKQKVDAEEVAPQAKIAELENQVHRLEQELKEIDE 197 

499 SDSEDYLKEGIJlAPLQSKLDTKKAKLSKLEELSDK-IDELDAEIiUCLE^^ 548 

_ hllll llhllllllllMlllllllllillllllllllillll M 

198 SESEDYAKEGFRAPLQSKLDAKKAKLSKLEELSDKIDELDAEIAKLEDQL 247 

" • * * • 

549 KDAEGNNNVEAYFKEGLEKTTAEKKAELEKAEADLK^^^ 598 
, 1"li = llilMlllll||l.MIII||M|||||||:||M||||: 
243 KAAEENNNViDY?r^GLEXTIAAKKAHLEKTEADLK:^^ 297 

599 PAPAPEKPAE . , KPAPAPEX . PAPAPEKPA , . PAPEKPAPA 634 

•!lll Ml llllih. Ill HIM I III.... 

298 .TPAFEAPAEQPKPAPAPQPAPAPKPEKPAEQPKPEKTDDQQAEEDYARK 346 

633 PEXPAPAPEKPAPTPETPKTGWKQENGJflVYTYNTDGSMATGW 676 

• = :|MII -lUilllllllllllllllll 

34/ SEEEYlNT?tT^QPPKAEKPAP. . .APKTGWKQENGMlvTrYim^ 388 

" • - • 

677 LQN^IGSVnTLNSNGAMATGWLQNNGSWYYLNSNGAMATGWLQYNGSWY^ 726 

lllilllllllllMIIIIIIMIIIIIIIIIill 

389 . MATGWLQNNGSWYYLNSNGAMATG;^LQYNGSWYYI* 423 

727 NANGDMATGWIQYNGSWYyLNA^XJDMAT<r^QyNGSV7ry 776 

MIMIIII . IIIIIIIIIIMMIhlllllllllMIMIIII 

424 NANGAMATGWAKVNGSWYYI^NGAMATGWLQYNGSWYYLNANGAMATGW 473 
777 FQYNGSWyyL^lANGDMATGWI^YNGSWYYLNSNGAKVTC;^QNNGSWyYL 826 

' iMltllllll.||||||||||||l!IMII|.||| . Illllll 

4/4 AXVNGSWyYIiNANGAMATGWLQm;SWrmiANGAMATG;^AKVNGSWyyL 523 

• • • " • 

827 NANGSMATDWVKr)GDTVrm,EASGAMKASQWFKVSDK:r^^ 876 

IIIMIhlllllllllllllllllllllllllMlllilll llllll 

524 NANGAMATGWVKIXJDTWYYLEASGAMKASQWFKVSDK^^rWGL^^ 573 

877 TTVDSYRWA^IGElAn/- 891 

lllhhilllllll 
574 TTVDGYKVNANGEVA/ 588 

Figure Bestfit analysis of PspA and PspC. Percent identitv 

is 69% and percent similarity is 77%. Amino acids of PsoA are on 
the bottom line (1-588) and amino acids of PspC are on the top 
line (249-891) . A dashed line indicates identity. 
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1 
11 
16 
22 
29 
34 
41 
44 
51 
56 
63 
64 
69 
76 
81 
85 
92 
99 
105 
108 
115 
122 
126 
133 
138 
145 
152 
159 
166 
173 
ISO 
137 
190 
197 
204 
210 
216 
223 
224 
234 
244 
254 
264 
268 
271 
277 
281 
288 
295 
300 
305 
312 
316 
323 
325 
332 
336 



Gin 
Tyr Ala 

Leu Lys 
Leu Lys 
Val Gin 

Leu Ser 



Asp lie Ser 
Asp Glu Val 
His Leu Glu 
Asp Val Lys 
Lys 

His Thr Gin 
Gly Leu lie 
Glu lie Lys 



Leu Tyr Asp Leu Lys 
Val Asn Val Leu Ser 
Glu Leu Thr 
Thr Lys 
Lys Glu Lys Leu Thr 
Phe Glu Gin Phe Lys 



Val Ala Glu 
Val Glu Glu 

Lys Glu Lys 
Tyr Pro Thr 
Tyr Lys Thr 
lie Ala Glu 
Val Lys Lys 
Leu Val Lys 
Glu Ser Gin 
lie Lys Gin 
Val Glu Ser 

Leu Lys Lys 

Arg Glu Glu 
Ala Asp 
Glu Ala 

Val Ala Thr 

Lys 



Glu 
Ala Gin 
Ala Lys 
Ala Glu 
Asp Arg 
He Thr 
Leu Glu 
Ser Asp 
Ala Glu 
Val Lys 
Asp Glu 
Ala Glu 
Lys Gin 
Ala Thr 
He Lys 
Ala Lys 
Ala Lys 
Val Glu 
Ser Glu 



Lys Lys 
Glu Ser 
Ser lie 
Lys A^n 

Asn Val* 
Thr Lys 
Lys Lys 
Tyr 

Glu-: Ala 
Ser Lys 
Glu Thr 
Ala Thr 
Lys Asp 

Lys Cys 
Lys Lys 
Lys Lys 
Asp Gin 
Arg Asn 

Leu Glu 
Val Glu 
Leu Glu 
Ala Lys 
Glu Lys 
Ala Glu 
Ala Glu 
Arg 

Thr Asp 
Arg Lys 
Leu Lys 
Lys Asn 
Gin Asp 



Glu Gly Val Arg Ser Gly Asn Asn Leu Thr 
Val Thr Ser Ser Gly 



*Thr Leu Pro Thr Glu Pro 



Thr 
Val Asp 
He Lys 
He Gin 



Leu Ser 
Leu Ser 
Lys Glu 
Leu Thr 



Glu His 
Glu Tyr 
Lys Met 
Leu Asp 
His Thr 
Asn Leu 
Ala He 
Tyr 
Val Leu 

Glu Leu 
Thr 
Ala Ala 



Glu Ser Gin 
Arg Lys Asp 

Leu Ser Glu 
Arg Arg Lys 
Gin Asn Val 
Asn He Lys 
Lys Thr Lys 
Leu Tyr Glu 
Glu Asn 
Ser Lys 
Thr Ser Lys 
Ala Glu 



Pro Lys Arg Arg Ala Lys Arg Gly Val Ser 
Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu 
Asn Asp Ala Lys Ser Ser Asp Ser Ser Val 
Gly Glu Glu Thr Leu Pro Ser Pro Ser Le»^ 
Asn Met Ala Asn 



Lys 



Lys 
Phe Glu Gin 



34/43 



wo 97/09994 



PCT/US96/14819 



343 Phe Lys Lys 

fcf , Asp Thr Leu Lys Pro 

•^^■^ Glu Lys Lys 

354 Val Ala Glu Ala Glu Lys Lys 

361 Val Glu Glu Ala Lys Lys Lys 

368 Ala Lys Asp Gin 

372 Lys Glu Glu Asp Arg Arg Asn 

379 Tyr . 

380 Pro Thr Asn Thr 
384 Tyr Lys Thr Leu Glu Leu Glu 
391 lie Ala Glu Ser Asp Val Lys 
398 Val Lys Glu Ala Glu 

403 Leu Glu Leu Val Lys Glu Glu 

410 Ala Asn Glu Ser Arg Asn^Glu 

417 Glu Lys lie Lys Gin Ala 

423 Lys Glu Lys Val Glu Ser Lys 

430 Lys Ala Glu Ala Thr Arg , 

436 Leu Glu Lys He Lys Thr Asp 

443 Arg Lys Lys Ala Glu. Glu Glu - : . 

450 Ala Lys Arg Eys 

454 Ala Glu Glu Ser Glu Lys Lys 

461 Ala Ala Glu Ala Lys Gin Lys 

468 Val Asp Ala Glu Glu Tyr Ala 

4*75 , Leu Glu Ala Lys 

479 He Ala Glu Leu Glu Tyr Glu 

486 Val Gin Arg Leu Glu Lvs Glu 

493 Leu Lys Glu 

496 lie Asp Glu Ser Asp Ser Glu 

503 Asp Tyr Leu Lys Glu Gly 

509 Leu Arg Ala 

512 Pro Leu Gin Ser Lys 

517 Leu Asp Thr Lys Lys Ala Lys 

524 • Leu Ser Lys 

527 Leu Glu Glu Leu Ser Asp Lys 
534 He Asp Glu Leu Asp Ala Glu 
541 lie Ala Lys Leu Glu Val Gin 
548 Leu Lys Asp Ala Glu Gly Asn 
555 Asn Asn 

557 Val Glu Ala Tyr Phe Lys Glu 
564 Gly Leu Glu Lys Thr 

569 Thr Ala Glu Lys Lys 

574 - Ala Glu Leu Glu Lys Ala 
580 Glu Ala Asp Leu Lys Lys Ala' 
587 Val Asp Glu 



Figure? coUed-coil motif of the a helix of PspC 

Amino acids that are not in the coUed-coil motif are m the right column. 
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Sequence of the alpha helical and proline region of LXS532 (PspC.D39) . 
Nucleotides 1-516 include the upstream region, noncoding region. The 
alpha helical region is underlined (bp 517-1112) and the proline regio 
is italicized (1113-1326). 

1 CCAAGCTATT AGGTGACACT ATAGAATACT CAAGCTATGC ATCAAGCTTA 

51 TGCTTGTCAA TAATCACAAA TATGTAGATC ATATCTTGTT TAGGACAGTA 

101 AAACATCCTA ATTACTTTTT AAATATTCTT CCTGAGTTGA TTGGCTTGAC 

151 CTTGTTGAGT CATGCTTATG TGACTTTTGT TTTAGTTTTT CCAGTTTATG 

201 CAGTTATTTT GTATCGACGA ATAGCTGAAG AGGAAAAGCT ATTACATGAA 

251 GTTATAATCC CAAATGGAAG CATAAAGAGA TAAATACAAA ATTCGATTTA 

301 TATACAGTTC ATATTGAAGT AATATAGTAA GGTTAAAGAA AAAATATAGA 

351 AGGAAATAAA CATGTTTGCA TCAAAAAGCG AAACSAAAAGT ACATTATTCA 

401 ATTCGTAAAT TTAGTATTGG AGTANCTAGT GTAGCTGTTG ccagtcttgt 

451 tatgggaagt gtggttcatg csaccagara aacgarggaa gtacccaagc 

501 AGCCMCTTCT TCTAATATGG PAAAnAnAqft ACATAflfiAAA QCYr^TAAAP 



fim 


MAGTCGTCGA TGAATATATA RAAAAftaiCT TCAr^AGAT TPARPTARaT 
AGAAGAAftflC ATACCCAAAA TGTCOCrTTA aapataaart TflAnrra-taT 




TANAACGAAG TATTTGCGTO AATTAAWTGT TWTARAABAr; AA(yrr(?Aft|ffl 


701 


ATGAGTTGCC GTCAGAAATA AAAGCRAagt TARArnrrGC TTTTGANAar 


751 


TTTAAAAAAG ATACATTGAA APrAr«AGAA AAnnTAGCWR AAnPTAftr^ftft 


SOI 


GAANGTTnAA GAAGCTAAGA AWAAAnrppA RnATPAAAaA cAAnAARATr 


B51 


GYCGTAACTA CCCAArrAAT APTTwratAA rRTTTYiArrT TRAAaT-mrn' 


901 


GAGTYCGATG TGAAAOTTAA AfiAtrirrysAc; mviaapTAc; TAAAPr;tr!«i^ 


951 


AGCTMMRGAA YCTCGAGAm arhaaaaaut TAAnPAArrcA AAARmAAAn 


1001 


TTGAGftGTftA AAAAGCTfiAG GCTACAAGGT tahaaaapat caagaparat 


1051 


NGTAAAAAAG CAGAAGAAGA AGMTAAAPq^ AAAP^AnPAfi A AHA Ar; ao-a ^ 



-Utti AGTTAAAGAA AAAPPAgrTirf ftfti^aa^r^jir^^ ACCAGCGCO? GNTACTCAAC 

1151 CAGAAAAACC AGCTCCAAAA CCAGAGAAGC CAGCTOAACA ACCAAAAGCA 
1201 GAAAAAACAG ATGATCAACA AGCTGAAGAA GACTATGCTC GTAGATCAGA 
1251 AGAAGAATAT AATCGCTTGA NTCAACAGCA ACCGCCAAAA ACTGAAAAAC 



Kg. 
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1301 CAGCACAACC ATNTACTCCA AAAACA 
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Fig. 2S Ccon^nufd) 
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Comparison of nucleotides of pspA.Rxl to pspC.D39. PspA is the top line 

(bp 875- 1322) and pspC is the bottom line (bp 877- 1326) . The region 

which is most homologous includes the nucleotides which encode the 

proline region (bp 1113 pspC, bp 1128 pspA) . 

Percent Similarity: 77.083 Percent Identity: 74.769 

875 AAAAAGCTAAACTATCAAAACTTGAAGAGTTAAGTGATAAGATTGATGAG 924 

INI III II I III III Mil: II II II I II 
877 AAAACGCTTGACCrrTGAAA. . TTGCTGAGTYCGATGTGAAAGTTAAAGAA 924 

925 TTAGACGCTGAAATTGCAAAACTTGAAGATCAACTTAAAGCTGCTGAAGA 974 

II INI I I III: MM h:: I II II 

925 GCGGAGCTTGAACTAGTAAARGAGGAAGCTMMRGAAYCTCGAGACGAGGA 974 

975 AAACAATAATGTAGAAQACTACTTTAAAQAAGGTTTAGAGAA^^ 1024 

Ml I III III J- .1111 Ml MM 
975 AAAAATTAAGCAAGCAAA .AGCGAAAGTTGAGAG 1007 

1025 CTGCTAAAAAAGCTGA ATTAGAAAAAACTGAAGCTGACCTT 1065 

IIIIIIMIIIi IIMIIII I I I II : I 

1008 ... . TAAAAAAGCTGAGGCTACAAGGTTAGAAAACATCAAGACAGATNGT 1053 
1066 AAGAAAGCAGTTAATGAGCCAGAAAAACCAGCTCCAGCTCCAGAAACTCC 1115 

II Mlllll I II II: MM I MM MM I 

1054 AAAAAAGCAG. . . AAGAAGAAGNTAAACGAAAAGCAGCAGAAGAAGATAA 1100 
1116 AGCCCCAGAAGCACCAGCTGAACAACCAAAACCAGCGCCGGCTCCTCAAC 1165 

11 MM Mllllllllllllll lllllllllllhl llllll 

1101 AGTTAAAGAAAAACCAGCTGAACAACCACAACCAGCGCCGGNTACTCAAC 1150 
1166 CAG. . .CTCCCGCACCAAAACCAGAGAAGCCAGCTGAACAACCAAAACCA 1212 

Ml II II IIIIIIIIIMIIIMIIIMIIIIIIIIIIM II 

1151 CAGAAAAACCAGCTCCAAAACCAGAGAAGCCAGCTGAACAACCAAAAGCA 1200 
1213 GAAAAAACAGATGATCAACAAGCTCAAGAAGACTATGCTCGTAGATCAGA 1262 

lllilllllllllllllllllMIIIIIIIMMIIIIIIIIMIIIMI 

1201 GAAAAAACAGATGATCAACAAGCTGAAGAAGACTATGCTCGTAGATCAGA 1250 
1263 AGAAGAATATAATCGCTOy^CTCAACAGCAACCGCCAAAAGCTGAAAAAC 1312 

IIIIIIIIIMIIIIIIII|:|lllllllllillMIIM IMIIIIM 

1251 AGAAGAATATAATCGCTTGANTCAACAGCAACCGCCAAAAACTGAAAAAC 1300 

1313 CAGCTC CTGCACCAAAAACA 1332 

Mil I :| I MIMMII 

1301 CAGCACAACCATNTACTCCAAAAACA 1326 
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BESTFIT pspC.EF6796 and pspC.D39. pspC of EF6796 is the top line (bp 1- 
bp 1042)and pspC of D39 is the bottom line (bp 44 - bp 1087). The 
sequences are highly homologous in the upstream noncoding region and the 
DNA encoding the proline region. 

Percent Similarity: 88.322 Percent Identity: 86.065 

" " " • . 

1 AAGCTTATGCTTGTCAATAATCACAAATATGTAGATCATATCTTGTTTAG 50 

lllllllllllllllllllllllllllllllllllllllllllillMII 

44 AAGCTTATGCTTGTCAATAATCACAAATATGTAGATCATATCTTGTTT 93 
51 GACAGTAAAACATCCTAATTACTTTTTAAATATTTTACCrK^ 100 

IIIIIIIIMIIIIIIIIMIIMIIIIIIilil I IIIIMIMIIII 

94 GACAGTAAAACATCCTAATTACTTTTTAAATATTCTTCCTGAGT^^ 143 
101 GCTTGACCTTGTTGAGTCATGCCTATATGACTTTTGTTTO 150 

MMMIIIIIIIIIMIIIII III IIIIIIIIIIIIIIIIIIIIIM 

144 GCTTGACCTTGTTGAGTCATGCTTATGTGACTTTTO 193 
151 GTTTATGCAGTTATTTTGTATCGACGAATAGCTGAAGAGGAAAAGTTATT 200 

ilillMMIIIIIIIIIIIIIINIIIIIIIIIIIIillllill Mil 

194 GTTTATGCAGTTATTTTGTATCGACGAATAGCTGAAGAGGAAAAGCTATT 243 
201 ACATGAAGTTATAATCCCAAATGGAAGCATAAAGAGATAAATACAAAATT 250 

Mllllllllllllllllllilllllllllllllllllllllllllllll 

244 ACATGAAGTTATAATCCCAAATGGAAGCATAAAGAGATAAATACAAAATT 293 
251 CGATTTATATACAGTTCATATTGAAGTGATATAGTAAGGTTAAAGAAAAA 300 

IINMMMIilllllllllllllll lllllllllllllillllllll 

294 CGATTTATATACAGTTCATATTGAAGTAATATAGTAAGGTTAAAGAAAAA 343 
301 ATATAGAAGGAAATAAACATGTTTGCATCAAAAAGCGAAAGAAAAGTACA 350 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIilllll 

344 ATATAGAAGGAAATAAACATGTTTGCATCAAAAAGCGAAAGAAAAGTACA 393 

351 TTATTCAATTCGTAAATTTAGTATTGGAGTAGCTAGTGTAGCTGTTGCCA 400 

llllllllilllllllllllllllllllllhlillllllllllllllll 
394 TTATTCAATTCGTAAATTTAGTATTGGAGTANCTAGTGTAGCTGTTGCCA 443 

401 GCTTGTTCTTAGGAGGAGTAGTCCATGCAGAAGGGGTTAGAAGTGGGAAT 450 

,,1111 IN I II II Mill I I I II :|||| 

444 GTCTTGTT ATGGG AAGTGTGGTTCATGC . . SACCAGARAAACGARGGAAG 491 

451 AACCTCA CGGTTACATCTAGTGGGCAAGATATATCGAAGAAGTATG 496 

I'll III INI I I I I I I I I I II 

492 TACCCAAGCAGCCMCTTCTTCTAATATGGCAAAGACAGAACATAGGAAAG 541 
497 CTGATGAA GTCGAGTCGCATCTAGAAAGTATATTGAAGGATGTC 540 

l = i I II mil 11 IMIII II MM III I 

542 CYGCTAAACMAGTCGTCGATGAATATATAGAAAAAATGTTGAGGGAGATT 591 
541 AAAA7UVAATTTGAAAAAAGTTCAACATACCCAAAATGTCGGCTTAATTAC 590 

II I II I II IIIMIIIIIIIIIIIII Mill I 

592 CAACTAGATAGAAGAA AACATACCCAAAATGTCGCCTTAAACAT 635 

591 AAAGTTGAGCGAAATTAAAAAGAAGTATTTGTATGACTTAAAAGTTA 637 
IIIMIIIIII Mllhll llllllllil III Mil: ill:"' 
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636 AAAGTTGAGCGCAATTANAACGAAGTATTTGCGTGAATTAANTGTTNTAG 685 

638 ATGTTTTATCGGAAGCTGAGTTGACGTCAAAAACAAAAGAAACAAAAGAA 687 

I I III h: lllllll nil lllll lllll 

686 AAGAGAAGTCGAANNATGAGTTGCCGTC AGAAATAAAAGCG 726 

688 AAGTTAACCGCAACTTTTGAGCAGTTTAAAAAAGATACATTACCAACAGA 737 

Mini III llllllh lllllllllllllllllll I 

727 AAGTTAGACGCCGCTTTTGANAAGTTTAAAAAAGATACATT GAA 770 

' • - • • 

738 ACCAGAAAAAAAGGTAGCAGAAGCTCAGAAGAAGGTTGAAGAAGCTAAGA 787 

Mill I lilllllllhllllll llllllhllllllllllllllll 

771 ACCAGGAGAAAAGGTAGCNGAAGCTAAGAAGAANGTTGAAGAAGCTAAGA 820 
788 AAAAAGCCGAGGATCAAAAAGAAAAAGATCGCCGTAACTACCCAACCATT 837 

hlllllhllllllllllllll llllllhllllllllllllllll I 

821 AWAAAGCCRAGGATCAAAAAGAAGAAGATCGYCGTAACTACCCAACCAAT 870 

838 ACTTACAAAACGCTTGAACTTCAAATTGCTCAGTCCGATGTGGAAGO^ 887 

lllhllllllllllll lllllllllllllll|:|IMIII lllllll 
871 ACTTRCAAAACGCTTGACCTTGAAATTGCTGAGTyCGATGTGAAAGTTAA 920 

888 AAAAGCGGAGCTTGAACTAGTAAAAGTGAAAGCTAAGGAATCTCAAGACG 937 

I lllllllllllllllllllllhl I lll||:::|||:||| lllll 
921 AGAAGCGGAGCTTGAACTAGTAAARGAGGAAGCTMMRGAAYCTCGAGACG 970 

938 AGGAAAAAATTAAGCAAGCAGAAGCGGAAGTTGAGAGTAAACAAGCTGAG 987 

II IMIIIIIIII lllllll Mill II III I Mill II I Mllllll 

971 AGGAAAAAATTAAGCAAGCAAAAGCGAAAGTTGAGAGTAAAAAAGCTGAG 1020 
988 GCTACAAGGTTAAAAAAAATCAAGACAGATCGT GAAGA 1025 

MIMIMMM MM IIIIIMIIIIhll Mill 

1021 GCTACAAGGTTAGAAAACATCAAGACAGATNGTAAAAAAGCAGAAGAAGA 1070 
1026 AGCTAAACGAAAAGCAG 1042 

Ihllllllllllllll 

1071 AGNTAAACGAAAAGCAG 1087 
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Amino acid comparison of PspC of EF6796 and D39. The size of the PspC 
molecule of D39 is 1/3 the size of PspC of EF6796. PspC.D39 does not 
appear to conatain a signal sequence. 

Percent Similarity: 71.212 Percent Identity: 67.803 

306 SQTEHRKD VDEYIKKMLSEIQLDRRKHTQNVNLNIKLSAIKTKYLY 351 

-•Mill* lllll-IIMIIIIIIIIillMlilllli Mil 

2 AKTEHRKAAKXVAmEYIEKMLREIQLDRRKHTQNVAIiNIKLSAIXTKYLR 51 
352 ELSVLKENSKKEELTSKTKAELTAAFEQFKKiyrLKPEKKVAEAEKKVEEA 401 

II I -M MM-.jl-MII •IIIIIMh-MMM MM 

52 ELXVXEEKS.XXELPSEIKAKLDAAFXKFKKDTLKPGEKVAEAKKXVEEA 100 
402 KKKAKDQKEEDRROTPrmTKTLELEIAESDVKVKEAELELVKEEA^ 451 

I M MMIIIIIIMII llhlllll II nil IM MM Ml I I 

101 KXKAXDQKEEDRRNYPTtmCKTU^LEIAEXDVKVKEAELELVKEEAXEXR 150 
452 NEEKIKQAKEKVESKKAEATRLEKIKTDRKKAEEEAKRKAEESEKKAAEA 501 

:||IIIIIMIIIIIIMIIIMIII Mini IIIM-:| 

151 DEEKIKQAKAKVESKKAEATRLENIKTDXKKAEEEXKRKAAEEDK 195 



552 SKLDTKKAKLSKLEELSDKIDELDAEIAKLEVQLKDAEGNNNVEAYFKEG 601 

M |.||. 

196 VKEKPAEQ 203 

* • • • • 

602 LEKTTAEKKAELEKAEADLKKAVDEPETPAPAPQPAPAPEKPAEKPAPAP 651 

..:.^lll IIIMM 
204 PQPAPXTQPEKPAPKPEKPAEQPKAEK 230 

652 EKPAPAPEKPAPAPEKP . APAPEKPAPAPEKPAPTPETPKT 691 

I I- I- -I-- -• =--:| -Mil- I Nil 

231 TDDQQAEEDYARRSEEEYNRLXQQQPPKTEKPAQ.PXTPKT 270 
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Comparison of the amino acids of PspC.D39 and PspA.Rxl. There is little 
homology except in the proline region. PspA.Rxl is the top line (aa 91 
- aa 444); PspC.D39 the bottom line (aa 2 - aa 270). 

Percent Similarity: 56.767 Percent Identity: 42.857 

91 AKKDAKNAKKAVEDAQKALDDAKAAQKKYDEDQKKTEEKAALEKAASEE^ 140 
||.: :.| I I I : . :.,: |.:| .:..||: |. 

2 AKTEHRKAAKXWD EYIEKMLREIQLDRRKHTQNVALNIKLSAIX 46 

• • • • . 

141 DKAVAAVQQAYIAYQQATDKAAKDAADKMIDEAKKREEEAKTKFNTVRAM 190 

•I I = .1 I : :.| II : 

47 TK YLREIOCVXEEKSXXELPSEIKAKLDAAFXKF, . .KKD 82 

191 WPEPEQLAETKKKSEEAKQKAPELTKKLEEAKAKLEEAEKKATEAKQKV 240 
.:...!.:IMI Mil M : .11:. :l:. 

83 TLKPGEaCVAEAKlbcN^EAKxkAXD QKEEDRRNYPTNTXKTL 123 

. 241 DAEEVAPQAKIAELENQVHRLEQELKEIDESESEDYAKEGFRAPLQSKLD 290 
I I :.|: I I :.:| I :.|. . II .(:: 
124 DLEIAEXDVKVKEAELEL. .VKEEAXEXRDEEKIKQAK AKVE 163 

291 AKKAKIiSKLEELSDKIDELDAEIAKLEDQLKAAEENNNVEDYFKEGLEKT 340 
■ill. .:|h: 

164 SKKAEATRLENI 175 

341 lAAKKAELEKTEADLKKAVNEPEKPAPAPETPAPEAPAEQPKPAPAPQPA 390 

.|.|.: h .-I-:! ..| |||M.||| .||. 

176 ... . KTDXKKAEEEXKRKAAEEDK VKEKPAEQPQPAPXTQPE 213 

391 . PAPKPEKPAEQPKPEKTDDCX^AEEDYARRSEEEYNRLTQQQPPKAEKPA 439 

iilliillilllhllllllllllllllllMlllil IIIIIMIII 

214 KPAPKPEKPAEQPKAEKTDDQQAEEDYARRSEEEYNRLXQQQPPKTEKPA 263 

440 PA. .PKT 444 
Ill 

264 QPXTPKT 270 
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